Skip to content

Instantly share code, notes, and snippets.

Show Gist options
  • Save tonysy/bedf4f876f6b6646efc51f9f761682be to your computer and use it in GitHub Desktop.
Save tonysy/bedf4f876f6b6646efc51f9f761682be to your computer and use it in GitHub Desktop.
This file has been truncated, but you can view the full file.
[03/27 20:46:20] pyaction INFO: Running with full config:
{'AVA': {'ANNOTATION_DIR': '/public/sist/home/hexm/Projects/pyaction/data/ava/annotations/',
'BGR': False,
'DETECTION_SCORE_THRESH': 0.8,
'EXCLUSION_FILE': 'ava_val_excluded_timestamps_v2.2.csv',
'FRAME_DIR': '/public/sist/home/hexm/Projects/pyaction/data/ava/frames',
'FRAME_LIST_DIR': '/public/sist/home/hexm/Projects/pyaction/data/ava/frame_lists',
'FULL_TEST_ON_VAL': False,
'GROUNDTRUTH_FILE': 'ava_val_v2.2.csv',
'IMG_PROC_BACKEND': 'cv2',
'LABEL_MAP_FILE': 'ava_action_list_v2.2_for_activitynet_2019.pbtxt',
'TEST_FORCE_FLIP': False,
'TEST_LISTS': ['val.csv'],
'TEST_PREDICT_BOX_LISTS': ['person_box_67091280_iou90/ava_detection_val_boxes_and_labels.csv'],
'TRAIN_GT_BOX_LISTS': ['ava_train_v2.2.csv'],
'TRAIN_LISTS': ['train.csv'],
'TRAIN_PCA_EIGVAL': [0.225, 0.224, 0.229],
'TRAIN_PCA_EIGVEC': [[-0.5675, 0.7192, 0.4009],
[-0.5808, -0.0045, -0.814],
[-0.5836, -0.6948, 0.4203]],
'TRAIN_PCA_JITTER_ONLY': True,
'TRAIN_PREDICT_BOX_LISTS': ['ava_train_v2.2.csv',
'person_box_67091280_iou90/ava_detection_train_boxes_and_labels_include_negative_v2.2.csv'],
'TRAIN_USE_COLOR_AUGMENTATION': False},
'BN': {'EPSILON': 1e-05,
'MOMENTUM': 0.1,
'NUM_BATCHES_PRECISE': 200,
'USE_PRECISE_STATS': False,
'WEIGHT_DECAY': 0.0},
'DATA': {'CROP_SIZE': 224,
'INPUT_CHANNEL_NUM': [3, 3],
'MEAN': [0.45, 0.45, 0.45],
'NUM_FRAMES': 32,
'PATH_PREFIX': '',
'PATH_TO_DATA_DIR': '',
'SAMPLING_RATE': 2,
'STD': [0.225, 0.225, 0.225],
'TEST_CROP_SIZE': 256,
'TRAIN_CROP_SIZE': 224,
'TRAIN_JITTER_SCALES': [256, 320]},
'DATA_LOADER': {'ENABLE_MULTI_THREAD_DECODE': False,
'NUM_WORKERS': 16,
'PIN_MEMORY': True},
'DETECTION': {'ALIGNED': True,
'ENABLE': True,
'ROI_XFORM_RESOLUTION': 7,
'SPATIAL_SCALE_FACTOR': 16},
'DIST_BACKEND': 'nccl',
'LOG_PERIOD': 10,
'MODEL': {'ARCH': 'slowfast',
'DROPOUT_RATE': 0.5,
'FC_INIT_STD': 0.01,
'LOSS_FUNC': 'bce',
'MULTI_PATHWAY_ARCH': ['slowfast'],
'NUM_CLASSES': 80,
'SINGLE_PATHWAY_ARCH': ['c2d', 'i3d', 'slowonly']},
'NONLOCAL': {'GROUP': [[1, 1], [1, 1], [1, 1], [1, 1]],
'INSTANTIATION': 'dot_product',
'LOCATION': [[[], []], [[], []], [[], []], [[], []]],
'POOL': [[[1, 2, 2], [1, 2, 2]],
[[1, 2, 2], [1, 2, 2]],
[[1, 2, 2], [1, 2, 2]],
[[1, 2, 2], [1, 2, 2]]]},
'NUM_GPUS': 4,
'NUM_SHARDS': 1,
'OUTPUT_DIR': '/public/sist/home/hexm/Models/pyaction/model_logs/ava/slowfast.ava.32x2.res50.short',
'RESNET': {'DEPTH': 50,
'INPLACE_RELU': True,
'NUM_BLOCK_TEMP_KERNEL': [[3, 3], [4, 4], [6, 6], [3, 3]],
'NUM_GROUPS': 1,
'SPATIAL_DILATIONS': [[1, 1], [1, 1], [1, 1], [2, 2]],
'SPATIAL_STRIDES': [[1, 1], [2, 2], [2, 2], [1, 1]],
'STRIDE_1X1': False,
'TRANS_FUNC': 'bottleneck_transform',
'WIDTH_PER_GROUP': 64,
'ZERO_INIT_FINAL_BN': True},
'RNG_SEED': 0,
'SHARD_ID': 0,
'SLOWFAST': {'ALPHA': 4,
'BETA_INV': 8,
'FUSION_CONV_CHANNEL_RATIO': 2,
'FUSION_KERNEL_SZ': 7},
'SOLVER': {'BASE_LR': 0.05,
'DAMPENING': 0.0,
'GAMMA': 0.1,
'LRS': [1, 0.1, 0.01, 0.001],
'LR_POLICY': 'steps_with_relative_lrs',
'MAX_EPOCH': 20,
'MOMENTUM': 0.9,
'NESTEROV': True,
'OPTIMIZING_METHOD': 'sgd',
'STEPS': [0, 10, 15, 20],
'STEP_SIZE': 1,
'WARMUP_EPOCHS': 5,
'WARMUP_FACTOR': 0.1,
'WARMUP_START_LR': 0.000125,
'WEIGHT_DECAY': 1e-07},
'TEST': {'BATCH_SIZE': 4,
'CHECKPOINT_FILE_PATH': '',
'CHECKPOINT_TYPE': 'pytorch',
'DATASET': 'ava',
'ENABLE': True,
'NUM_ENSEMBLE_VIEWS': 10,
'NUM_SPATIAL_CROPS': 3},
'TRAIN': {'AUTO_RESUME': True,
'BATCH_SIZE': 32,
'CHECKPOINT_FILE_PATH': '/public/sist/home/hexm/Projects/pyaction/model_zoo/ava/pretrain/SLOWFAST_8x8_R50.pkl',
'CHECKPOINT_INFLATE': False,
'CHECKPOINT_PERIOD': 1,
'CHECKPOINT_TYPE': 'caffe2',
'DATASET': 'ava',
'ENABLE': True,
'EVAL_PERIOD': 5}}
[03/27 20:46:20] pyaction INFO: different config with base class:
{'AVA': {'ANNOTATION_DIR': '/public/sist/home/hexm/Projects/pyaction/data/ava/annotations/',
'BGR': False,
'DETECTION_SCORE_THRESH': 0.8,
'EXCLUSION_FILE': 'ava_val_excluded_timestamps_v2.2.csv',
'FRAME_DIR': '/public/sist/home/hexm/Projects/pyaction/data/ava/frames',
'FRAME_LIST_DIR': '/public/sist/home/hexm/Projects/pyaction/data/ava/frame_lists',
'FULL_TEST_ON_VAL': False,
'GROUNDTRUTH_FILE': 'ava_val_v2.2.csv',
'IMG_PROC_BACKEND': 'cv2',
'LABEL_MAP_FILE': 'ava_action_list_v2.2_for_activitynet_2019.pbtxt',
'TEST_FORCE_FLIP': False,
'TEST_LISTS': ['val.csv'],
'TEST_PREDICT_BOX_LISTS': ['person_box_67091280_iou90/ava_detection_val_boxes_and_labels.csv'],
'TRAIN_GT_BOX_LISTS': ['ava_train_v2.2.csv'],
'TRAIN_LISTS': ['train.csv'],
'TRAIN_PCA_EIGVAL': [0.225, 0.224, 0.229],
'TRAIN_PCA_EIGVEC': [[-0.5675, 0.7192, 0.4009],
[-0.5808, -0.0045, -0.814],
[-0.5836, -0.6948, 0.4203]],
'TRAIN_PCA_JITTER_ONLY': True,
'TRAIN_PREDICT_BOX_LISTS': ['ava_train_v2.2.csv',
'person_box_67091280_iou90/ava_detection_train_boxes_and_labels_include_negative_v2.2.csv'],
'TRAIN_USE_COLOR_AUGMENTATION': False},
'DATA': {'NUM_FRAMES': 32, 'SAMPLING_RATE': 2},
'DATA_LOADER': {'NUM_WORKERS': 16},
'DETECTION': {'ALIGNED': True,
'ENABLE': True,
'ROI_XFORM_RESOLUTION': 7,
'SPATIAL_SCALE_FACTOR': 16},
'MODEL': {'LOSS_FUNC': 'bce', 'NUM_CLASSES': 80},
'NONLOCAL': {'GROUP': [[1, 1], [1, 1], [1, 1], [1, 1]],
'LOCATION': [[[], []], [[], []], [[], []], [[], []]]},
'NUM_GPUS': 4,
'OUTPUT_DIR': '/public/sist/home/hexm/Models/pyaction/model_logs/ava/slowfast.ava.32x2.res50.short',
'RESNET': {'NUM_BLOCK_TEMP_KERNEL': [[3, 3], [4, 4], [6, 6], [3, 3]],
'SPATIAL_DILATIONS': [[1, 1], [1, 1], [1, 1], [2, 2]],
'SPATIAL_STRIDES': [[1, 1], [2, 2], [2, 2], [1, 1]],
'ZERO_INIT_FINAL_BN': True},
'RNG_SEED': 0,
'SLOWFAST': {'ALPHA': 4,
'BETA_INV': 8,
'FUSION_CONV_CHANNEL_RATIO': 2,
'FUSION_KERNEL_SZ': 7},
'SOLVER': {'BASE_LR': 0.05,
'LRS': [1, 0.1, 0.01, 0.001],
'LR_POLICY': 'steps_with_relative_lrs',
'MAX_EPOCH': 20,
'STEPS': [0, 10, 15, 20],
'WARMUP_EPOCHS': 5,
'WARMUP_START_LR': 0.000125,
'WEIGHT_DECAY': 1e-07},
'TEST': {'BATCH_SIZE': 4, 'DATASET': 'ava'},
'TRAIN': {'BATCH_SIZE': 32,
'CHECKPOINT_FILE_PATH': '/public/sist/home/hexm/Projects/pyaction/model_zoo/ava/pretrain/SLOWFAST_8x8_R50.pkl',
'CHECKPOINT_TYPE': 'caffe2',
'DATASET': 'ava',
'EVAL_PERIOD': 5}}
[03/27 20:46:54] pa.utils.misc INFO: Model:
SlowFastModel(
(s1): VideoModelStem(
(pathway0_stem): ResNetBasicStem(
(conv): Conv3d(3, 64, kernel_size=[1, 7, 7], stride=[1, 2, 2], padding=[0, 3, 3], bias=False)
(bn): BatchNorm3d(64, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
(relu): ReLU(inplace=True)
(pool_layer): MaxPool3d(kernel_size=[1, 3, 3], stride=[1, 2, 2], padding=[0, 1, 1], dilation=1, ceil_mode=False)
)
(pathway1_stem): ResNetBasicStem(
(conv): Conv3d(3, 8, kernel_size=[5, 7, 7], stride=[1, 2, 2], padding=[2, 3, 3], bias=False)
(bn): BatchNorm3d(8, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
(relu): ReLU(inplace=True)
(pool_layer): MaxPool3d(kernel_size=[1, 3, 3], stride=[1, 2, 2], padding=[0, 1, 1], dilation=1, ceil_mode=False)
)
)
(s1_fuse): FuseFastToSlow(
(conv_f2s): Conv3d(8, 16, kernel_size=[7, 1, 1], stride=[4, 1, 1], padding=[3, 0, 0], bias=False)
(bn): BatchNorm3d(16, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
(relu): ReLU(inplace=True)
)
(s2): ResStage(
(pathway0_res0): ResBlock(
(branch1): Conv3d(80, 256, kernel_size=(1, 1, 1), stride=[1, 1, 1], bias=False)
(branch1_bn): BatchNorm3d(256, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
(branch2): BottleneckTransform(
(a): Conv3d(80, 64, kernel_size=[1, 1, 1], stride=[1, 1, 1], padding=[0, 0, 0], bias=False)
(a_bn): BatchNorm3d(64, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
(a_relu): ReLU(inplace=True)
(b): Conv3d(64, 64, kernel_size=[1, 3, 3], stride=[1, 1, 1], padding=[0, 1, 1], dilation=[1, 1, 1], bias=False)
(b_bn): BatchNorm3d(64, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
(b_relu): ReLU(inplace=True)
(c): Conv3d(64, 256, kernel_size=[1, 1, 1], stride=[1, 1, 1], padding=[0, 0, 0], bias=False)
(c_bn): BatchNorm3d(256, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
)
(relu): ReLU(inplace=True)
)
(pathway0_res1): ResBlock(
(branch2): BottleneckTransform(
(a): Conv3d(256, 64, kernel_size=[1, 1, 1], stride=[1, 1, 1], padding=[0, 0, 0], bias=False)
(a_bn): BatchNorm3d(64, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
(a_relu): ReLU(inplace=True)
(b): Conv3d(64, 64, kernel_size=[1, 3, 3], stride=[1, 1, 1], padding=[0, 1, 1], dilation=[1, 1, 1], bias=False)
(b_bn): BatchNorm3d(64, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
(b_relu): ReLU(inplace=True)
(c): Conv3d(64, 256, kernel_size=[1, 1, 1], stride=[1, 1, 1], padding=[0, 0, 0], bias=False)
(c_bn): BatchNorm3d(256, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
)
(relu): ReLU(inplace=True)
)
(pathway0_res2): ResBlock(
(branch2): BottleneckTransform(
(a): Conv3d(256, 64, kernel_size=[1, 1, 1], stride=[1, 1, 1], padding=[0, 0, 0], bias=False)
(a_bn): BatchNorm3d(64, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
(a_relu): ReLU(inplace=True)
(b): Conv3d(64, 64, kernel_size=[1, 3, 3], stride=[1, 1, 1], padding=[0, 1, 1], dilation=[1, 1, 1], bias=False)
(b_bn): BatchNorm3d(64, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
(b_relu): ReLU(inplace=True)
(c): Conv3d(64, 256, kernel_size=[1, 1, 1], stride=[1, 1, 1], padding=[0, 0, 0], bias=False)
(c_bn): BatchNorm3d(256, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
)
(relu): ReLU(inplace=True)
)
(pathway1_res0): ResBlock(
(branch1): Conv3d(8, 32, kernel_size=(1, 1, 1), stride=[1, 1, 1], bias=False)
(branch1_bn): BatchNorm3d(32, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
(branch2): BottleneckTransform(
(a): Conv3d(8, 8, kernel_size=[3, 1, 1], stride=[1, 1, 1], padding=[1, 0, 0], bias=False)
(a_bn): BatchNorm3d(8, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
(a_relu): ReLU(inplace=True)
(b): Conv3d(8, 8, kernel_size=[1, 3, 3], stride=[1, 1, 1], padding=[0, 1, 1], dilation=[1, 1, 1], bias=False)
(b_bn): BatchNorm3d(8, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
(b_relu): ReLU(inplace=True)
(c): Conv3d(8, 32, kernel_size=[1, 1, 1], stride=[1, 1, 1], padding=[0, 0, 0], bias=False)
(c_bn): BatchNorm3d(32, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
)
(relu): ReLU(inplace=True)
)
(pathway1_res1): ResBlock(
(branch2): BottleneckTransform(
(a): Conv3d(32, 8, kernel_size=[3, 1, 1], stride=[1, 1, 1], padding=[1, 0, 0], bias=False)
(a_bn): BatchNorm3d(8, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
(a_relu): ReLU(inplace=True)
(b): Conv3d(8, 8, kernel_size=[1, 3, 3], stride=[1, 1, 1], padding=[0, 1, 1], dilation=[1, 1, 1], bias=False)
(b_bn): BatchNorm3d(8, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
(b_relu): ReLU(inplace=True)
(c): Conv3d(8, 32, kernel_size=[1, 1, 1], stride=[1, 1, 1], padding=[0, 0, 0], bias=False)
(c_bn): BatchNorm3d(32, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
)
(relu): ReLU(inplace=True)
)
(pathway1_res2): ResBlock(
(branch2): BottleneckTransform(
(a): Conv3d(32, 8, kernel_size=[3, 1, 1], stride=[1, 1, 1], padding=[1, 0, 0], bias=False)
(a_bn): BatchNorm3d(8, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
(a_relu): ReLU(inplace=True)
(b): Conv3d(8, 8, kernel_size=[1, 3, 3], stride=[1, 1, 1], padding=[0, 1, 1], dilation=[1, 1, 1], bias=False)
(b_bn): BatchNorm3d(8, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
(b_relu): ReLU(inplace=True)
(c): Conv3d(8, 32, kernel_size=[1, 1, 1], stride=[1, 1, 1], padding=[0, 0, 0], bias=False)
(c_bn): BatchNorm3d(32, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
)
(relu): ReLU(inplace=True)
)
)
(s2_fuse): FuseFastToSlow(
(conv_f2s): Conv3d(32, 64, kernel_size=[7, 1, 1], stride=[4, 1, 1], padding=[3, 0, 0], bias=False)
(bn): BatchNorm3d(64, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
(relu): ReLU(inplace=True)
)
(pathway0_pool): MaxPool3d(kernel_size=[1, 1, 1], stride=[1, 1, 1], padding=[0, 0, 0], dilation=1, ceil_mode=False)
(pathway1_pool): MaxPool3d(kernel_size=[1, 1, 1], stride=[1, 1, 1], padding=[0, 0, 0], dilation=1, ceil_mode=False)
(s3): ResStage(
(pathway0_res0): ResBlock(
(branch1): Conv3d(320, 512, kernel_size=(1, 1, 1), stride=[1, 2, 2], bias=False)
(branch1_bn): BatchNorm3d(512, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
(branch2): BottleneckTransform(
(a): Conv3d(320, 128, kernel_size=[1, 1, 1], stride=[1, 1, 1], padding=[0, 0, 0], bias=False)
(a_bn): BatchNorm3d(128, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
(a_relu): ReLU(inplace=True)
(b): Conv3d(128, 128, kernel_size=[1, 3, 3], stride=[1, 2, 2], padding=[0, 1, 1], dilation=[1, 1, 1], bias=False)
(b_bn): BatchNorm3d(128, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
(b_relu): ReLU(inplace=True)
(c): Conv3d(128, 512, kernel_size=[1, 1, 1], stride=[1, 1, 1], padding=[0, 0, 0], bias=False)
(c_bn): BatchNorm3d(512, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
)
(relu): ReLU(inplace=True)
)
(pathway0_res1): ResBlock(
(branch2): BottleneckTransform(
(a): Conv3d(512, 128, kernel_size=[1, 1, 1], stride=[1, 1, 1], padding=[0, 0, 0], bias=False)
(a_bn): BatchNorm3d(128, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
(a_relu): ReLU(inplace=True)
(b): Conv3d(128, 128, kernel_size=[1, 3, 3], stride=[1, 1, 1], padding=[0, 1, 1], dilation=[1, 1, 1], bias=False)
(b_bn): BatchNorm3d(128, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
(b_relu): ReLU(inplace=True)
(c): Conv3d(128, 512, kernel_size=[1, 1, 1], stride=[1, 1, 1], padding=[0, 0, 0], bias=False)
(c_bn): BatchNorm3d(512, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
)
(relu): ReLU(inplace=True)
)
(pathway0_res2): ResBlock(
(branch2): BottleneckTransform(
(a): Conv3d(512, 128, kernel_size=[1, 1, 1], stride=[1, 1, 1], padding=[0, 0, 0], bias=False)
(a_bn): BatchNorm3d(128, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
(a_relu): ReLU(inplace=True)
(b): Conv3d(128, 128, kernel_size=[1, 3, 3], stride=[1, 1, 1], padding=[0, 1, 1], dilation=[1, 1, 1], bias=False)
(b_bn): BatchNorm3d(128, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
(b_relu): ReLU(inplace=True)
(c): Conv3d(128, 512, kernel_size=[1, 1, 1], stride=[1, 1, 1], padding=[0, 0, 0], bias=False)
(c_bn): BatchNorm3d(512, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
)
(relu): ReLU(inplace=True)
)
(pathway0_res3): ResBlock(
(branch2): BottleneckTransform(
(a): Conv3d(512, 128, kernel_size=[1, 1, 1], stride=[1, 1, 1], padding=[0, 0, 0], bias=False)
(a_bn): BatchNorm3d(128, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
(a_relu): ReLU(inplace=True)
(b): Conv3d(128, 128, kernel_size=[1, 3, 3], stride=[1, 1, 1], padding=[0, 1, 1], dilation=[1, 1, 1], bias=False)
(b_bn): BatchNorm3d(128, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
(b_relu): ReLU(inplace=True)
(c): Conv3d(128, 512, kernel_size=[1, 1, 1], stride=[1, 1, 1], padding=[0, 0, 0], bias=False)
(c_bn): BatchNorm3d(512, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
)
(relu): ReLU(inplace=True)
)
(pathway1_res0): ResBlock(
(branch1): Conv3d(32, 64, kernel_size=(1, 1, 1), stride=[1, 2, 2], bias=False)
(branch1_bn): BatchNorm3d(64, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
(branch2): BottleneckTransform(
(a): Conv3d(32, 16, kernel_size=[3, 1, 1], stride=[1, 1, 1], padding=[1, 0, 0], bias=False)
(a_bn): BatchNorm3d(16, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
(a_relu): ReLU(inplace=True)
(b): Conv3d(16, 16, kernel_size=[1, 3, 3], stride=[1, 2, 2], padding=[0, 1, 1], dilation=[1, 1, 1], bias=False)
(b_bn): BatchNorm3d(16, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
(b_relu): ReLU(inplace=True)
(c): Conv3d(16, 64, kernel_size=[1, 1, 1], stride=[1, 1, 1], padding=[0, 0, 0], bias=False)
(c_bn): BatchNorm3d(64, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
)
(relu): ReLU(inplace=True)
)
(pathway1_res1): ResBlock(
(branch2): BottleneckTransform(
(a): Conv3d(64, 16, kernel_size=[3, 1, 1], stride=[1, 1, 1], padding=[1, 0, 0], bias=False)
(a_bn): BatchNorm3d(16, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
(a_relu): ReLU(inplace=True)
(b): Conv3d(16, 16, kernel_size=[1, 3, 3], stride=[1, 1, 1], padding=[0, 1, 1], dilation=[1, 1, 1], bias=False)
(b_bn): BatchNorm3d(16, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
(b_relu): ReLU(inplace=True)
(c): Conv3d(16, 64, kernel_size=[1, 1, 1], stride=[1, 1, 1], padding=[0, 0, 0], bias=False)
(c_bn): BatchNorm3d(64, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
)
(relu): ReLU(inplace=True)
)
(pathway1_res2): ResBlock(
(branch2): BottleneckTransform(
(a): Conv3d(64, 16, kernel_size=[3, 1, 1], stride=[1, 1, 1], padding=[1, 0, 0], bias=False)
(a_bn): BatchNorm3d(16, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
(a_relu): ReLU(inplace=True)
(b): Conv3d(16, 16, kernel_size=[1, 3, 3], stride=[1, 1, 1], padding=[0, 1, 1], dilation=[1, 1, 1], bias=False)
(b_bn): BatchNorm3d(16, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
(b_relu): ReLU(inplace=True)
(c): Conv3d(16, 64, kernel_size=[1, 1, 1], stride=[1, 1, 1], padding=[0, 0, 0], bias=False)
(c_bn): BatchNorm3d(64, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
)
(relu): ReLU(inplace=True)
)
(pathway1_res3): ResBlock(
(branch2): BottleneckTransform(
(a): Conv3d(64, 16, kernel_size=[3, 1, 1], stride=[1, 1, 1], padding=[1, 0, 0], bias=False)
(a_bn): BatchNorm3d(16, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
(a_relu): ReLU(inplace=True)
(b): Conv3d(16, 16, kernel_size=[1, 3, 3], stride=[1, 1, 1], padding=[0, 1, 1], dilation=[1, 1, 1], bias=False)
(b_bn): BatchNorm3d(16, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
(b_relu): ReLU(inplace=True)
(c): Conv3d(16, 64, kernel_size=[1, 1, 1], stride=[1, 1, 1], padding=[0, 0, 0], bias=False)
(c_bn): BatchNorm3d(64, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
)
(relu): ReLU(inplace=True)
)
)
(s3_fuse): FuseFastToSlow(
(conv_f2s): Conv3d(64, 128, kernel_size=[7, 1, 1], stride=[4, 1, 1], padding=[3, 0, 0], bias=False)
(bn): BatchNorm3d(128, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
(relu): ReLU(inplace=True)
)
(s4): ResStage(
(pathway0_res0): ResBlock(
(branch1): Conv3d(640, 1024, kernel_size=(1, 1, 1), stride=[1, 2, 2], bias=False)
(branch1_bn): BatchNorm3d(1024, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
(branch2): BottleneckTransform(
(a): Conv3d(640, 256, kernel_size=[3, 1, 1], stride=[1, 1, 1], padding=[1, 0, 0], bias=False)
(a_bn): BatchNorm3d(256, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
(a_relu): ReLU(inplace=True)
(b): Conv3d(256, 256, kernel_size=[1, 3, 3], stride=[1, 2, 2], padding=[0, 1, 1], dilation=[1, 1, 1], bias=False)
(b_bn): BatchNorm3d(256, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
(b_relu): ReLU(inplace=True)
(c): Conv3d(256, 1024, kernel_size=[1, 1, 1], stride=[1, 1, 1], padding=[0, 0, 0], bias=False)
(c_bn): BatchNorm3d(1024, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
)
(relu): ReLU(inplace=True)
)
(pathway0_res1): ResBlock(
(branch2): BottleneckTransform(
(a): Conv3d(1024, 256, kernel_size=[3, 1, 1], stride=[1, 1, 1], padding=[1, 0, 0], bias=False)
(a_bn): BatchNorm3d(256, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
(a_relu): ReLU(inplace=True)
(b): Conv3d(256, 256, kernel_size=[1, 3, 3], stride=[1, 1, 1], padding=[0, 1, 1], dilation=[1, 1, 1], bias=False)
(b_bn): BatchNorm3d(256, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
(b_relu): ReLU(inplace=True)
(c): Conv3d(256, 1024, kernel_size=[1, 1, 1], stride=[1, 1, 1], padding=[0, 0, 0], bias=False)
(c_bn): BatchNorm3d(1024, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
)
(relu): ReLU(inplace=True)
)
(pathway0_res2): ResBlock(
(branch2): BottleneckTransform(
(a): Conv3d(1024, 256, kernel_size=[3, 1, 1], stride=[1, 1, 1], padding=[1, 0, 0], bias=False)
(a_bn): BatchNorm3d(256, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
(a_relu): ReLU(inplace=True)
(b): Conv3d(256, 256, kernel_size=[1, 3, 3], stride=[1, 1, 1], padding=[0, 1, 1], dilation=[1, 1, 1], bias=False)
(b_bn): BatchNorm3d(256, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
(b_relu): ReLU(inplace=True)
(c): Conv3d(256, 1024, kernel_size=[1, 1, 1], stride=[1, 1, 1], padding=[0, 0, 0], bias=False)
(c_bn): BatchNorm3d(1024, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
)
(relu): ReLU(inplace=True)
)
(pathway0_res3): ResBlock(
(branch2): BottleneckTransform(
(a): Conv3d(1024, 256, kernel_size=[3, 1, 1], stride=[1, 1, 1], padding=[1, 0, 0], bias=False)
(a_bn): BatchNorm3d(256, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
(a_relu): ReLU(inplace=True)
(b): Conv3d(256, 256, kernel_size=[1, 3, 3], stride=[1, 1, 1], padding=[0, 1, 1], dilation=[1, 1, 1], bias=False)
(b_bn): BatchNorm3d(256, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
(b_relu): ReLU(inplace=True)
(c): Conv3d(256, 1024, kernel_size=[1, 1, 1], stride=[1, 1, 1], padding=[0, 0, 0], bias=False)
(c_bn): BatchNorm3d(1024, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
)
(relu): ReLU(inplace=True)
)
(pathway0_res4): ResBlock(
(branch2): BottleneckTransform(
(a): Conv3d(1024, 256, kernel_size=[3, 1, 1], stride=[1, 1, 1], padding=[1, 0, 0], bias=False)
(a_bn): BatchNorm3d(256, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
(a_relu): ReLU(inplace=True)
(b): Conv3d(256, 256, kernel_size=[1, 3, 3], stride=[1, 1, 1], padding=[0, 1, 1], dilation=[1, 1, 1], bias=False)
(b_bn): BatchNorm3d(256, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
(b_relu): ReLU(inplace=True)
(c): Conv3d(256, 1024, kernel_size=[1, 1, 1], stride=[1, 1, 1], padding=[0, 0, 0], bias=False)
(c_bn): BatchNorm3d(1024, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
)
(relu): ReLU(inplace=True)
)
(pathway0_res5): ResBlock(
(branch2): BottleneckTransform(
(a): Conv3d(1024, 256, kernel_size=[3, 1, 1], stride=[1, 1, 1], padding=[1, 0, 0], bias=False)
(a_bn): BatchNorm3d(256, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
(a_relu): ReLU(inplace=True)
(b): Conv3d(256, 256, kernel_size=[1, 3, 3], stride=[1, 1, 1], padding=[0, 1, 1], dilation=[1, 1, 1], bias=False)
(b_bn): BatchNorm3d(256, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
(b_relu): ReLU(inplace=True)
(c): Conv3d(256, 1024, kernel_size=[1, 1, 1], stride=[1, 1, 1], padding=[0, 0, 0], bias=False)
(c_bn): BatchNorm3d(1024, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
)
(relu): ReLU(inplace=True)
)
(pathway1_res0): ResBlock(
(branch1): Conv3d(64, 128, kernel_size=(1, 1, 1), stride=[1, 2, 2], bias=False)
(branch1_bn): BatchNorm3d(128, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
(branch2): BottleneckTransform(
(a): Conv3d(64, 32, kernel_size=[3, 1, 1], stride=[1, 1, 1], padding=[1, 0, 0], bias=False)
(a_bn): BatchNorm3d(32, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
(a_relu): ReLU(inplace=True)
(b): Conv3d(32, 32, kernel_size=[1, 3, 3], stride=[1, 2, 2], padding=[0, 1, 1], dilation=[1, 1, 1], bias=False)
(b_bn): BatchNorm3d(32, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
(b_relu): ReLU(inplace=True)
(c): Conv3d(32, 128, kernel_size=[1, 1, 1], stride=[1, 1, 1], padding=[0, 0, 0], bias=False)
(c_bn): BatchNorm3d(128, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
)
(relu): ReLU(inplace=True)
)
(pathway1_res1): ResBlock(
(branch2): BottleneckTransform(
(a): Conv3d(128, 32, kernel_size=[3, 1, 1], stride=[1, 1, 1], padding=[1, 0, 0], bias=False)
(a_bn): BatchNorm3d(32, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
(a_relu): ReLU(inplace=True)
(b): Conv3d(32, 32, kernel_size=[1, 3, 3], stride=[1, 1, 1], padding=[0, 1, 1], dilation=[1, 1, 1], bias=False)
(b_bn): BatchNorm3d(32, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
(b_relu): ReLU(inplace=True)
(c): Conv3d(32, 128, kernel_size=[1, 1, 1], stride=[1, 1, 1], padding=[0, 0, 0], bias=False)
(c_bn): BatchNorm3d(128, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
)
(relu): ReLU(inplace=True)
)
(pathway1_res2): ResBlock(
(branch2): BottleneckTransform(
(a): Conv3d(128, 32, kernel_size=[3, 1, 1], stride=[1, 1, 1], padding=[1, 0, 0], bias=False)
(a_bn): BatchNorm3d(32, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
(a_relu): ReLU(inplace=True)
(b): Conv3d(32, 32, kernel_size=[1, 3, 3], stride=[1, 1, 1], padding=[0, 1, 1], dilation=[1, 1, 1], bias=False)
(b_bn): BatchNorm3d(32, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
(b_relu): ReLU(inplace=True)
(c): Conv3d(32, 128, kernel_size=[1, 1, 1], stride=[1, 1, 1], padding=[0, 0, 0], bias=False)
(c_bn): BatchNorm3d(128, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
)
(relu): ReLU(inplace=True)
)
(pathway1_res3): ResBlock(
(branch2): BottleneckTransform(
(a): Conv3d(128, 32, kernel_size=[3, 1, 1], stride=[1, 1, 1], padding=[1, 0, 0], bias=False)
(a_bn): BatchNorm3d(32, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
(a_relu): ReLU(inplace=True)
(b): Conv3d(32, 32, kernel_size=[1, 3, 3], stride=[1, 1, 1], padding=[0, 1, 1], dilation=[1, 1, 1], bias=False)
(b_bn): BatchNorm3d(32, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
(b_relu): ReLU(inplace=True)
(c): Conv3d(32, 128, kernel_size=[1, 1, 1], stride=[1, 1, 1], padding=[0, 0, 0], bias=False)
(c_bn): BatchNorm3d(128, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
)
(relu): ReLU(inplace=True)
)
(pathway1_res4): ResBlock(
(branch2): BottleneckTransform(
(a): Conv3d(128, 32, kernel_size=[3, 1, 1], stride=[1, 1, 1], padding=[1, 0, 0], bias=False)
(a_bn): BatchNorm3d(32, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
(a_relu): ReLU(inplace=True)
(b): Conv3d(32, 32, kernel_size=[1, 3, 3], stride=[1, 1, 1], padding=[0, 1, 1], dilation=[1, 1, 1], bias=False)
(b_bn): BatchNorm3d(32, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
(b_relu): ReLU(inplace=True)
(c): Conv3d(32, 128, kernel_size=[1, 1, 1], stride=[1, 1, 1], padding=[0, 0, 0], bias=False)
(c_bn): BatchNorm3d(128, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
)
(relu): ReLU(inplace=True)
)
(pathway1_res5): ResBlock(
(branch2): BottleneckTransform(
(a): Conv3d(128, 32, kernel_size=[3, 1, 1], stride=[1, 1, 1], padding=[1, 0, 0], bias=False)
(a_bn): BatchNorm3d(32, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
(a_relu): ReLU(inplace=True)
(b): Conv3d(32, 32, kernel_size=[1, 3, 3], stride=[1, 1, 1], padding=[0, 1, 1], dilation=[1, 1, 1], bias=False)
(b_bn): BatchNorm3d(32, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
(b_relu): ReLU(inplace=True)
(c): Conv3d(32, 128, kernel_size=[1, 1, 1], stride=[1, 1, 1], padding=[0, 0, 0], bias=False)
(c_bn): BatchNorm3d(128, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
)
(relu): ReLU(inplace=True)
)
)
(s4_fuse): FuseFastToSlow(
(conv_f2s): Conv3d(128, 256, kernel_size=[7, 1, 1], stride=[4, 1, 1], padding=[3, 0, 0], bias=False)
(bn): BatchNorm3d(256, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
(relu): ReLU(inplace=True)
)
(s5): ResStage(
(pathway0_res0): ResBlock(
(branch1): Conv3d(1280, 2048, kernel_size=(1, 1, 1), stride=[1, 1, 1], bias=False)
(branch1_bn): BatchNorm3d(2048, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
(branch2): BottleneckTransform(
(a): Conv3d(1280, 512, kernel_size=[3, 1, 1], stride=[1, 1, 1], padding=[1, 0, 0], bias=False)
(a_bn): BatchNorm3d(512, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
(a_relu): ReLU(inplace=True)
(b): Conv3d(512, 512, kernel_size=[1, 3, 3], stride=[1, 1, 1], padding=[0, 2, 2], dilation=[1, 2, 2], bias=False)
(b_bn): BatchNorm3d(512, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
(b_relu): ReLU(inplace=True)
(c): Conv3d(512, 2048, kernel_size=[1, 1, 1], stride=[1, 1, 1], padding=[0, 0, 0], bias=False)
(c_bn): BatchNorm3d(2048, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
)
(relu): ReLU(inplace=True)
)
(pathway0_res1): ResBlock(
(branch2): BottleneckTransform(
(a): Conv3d(2048, 512, kernel_size=[3, 1, 1], stride=[1, 1, 1], padding=[1, 0, 0], bias=False)
(a_bn): BatchNorm3d(512, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
(a_relu): ReLU(inplace=True)
(b): Conv3d(512, 512, kernel_size=[1, 3, 3], stride=[1, 1, 1], padding=[0, 2, 2], dilation=[1, 2, 2], bias=False)
(b_bn): BatchNorm3d(512, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
(b_relu): ReLU(inplace=True)
(c): Conv3d(512, 2048, kernel_size=[1, 1, 1], stride=[1, 1, 1], padding=[0, 0, 0], bias=False)
(c_bn): BatchNorm3d(2048, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
)
(relu): ReLU(inplace=True)
)
(pathway0_res2): ResBlock(
(branch2): BottleneckTransform(
(a): Conv3d(2048, 512, kernel_size=[3, 1, 1], stride=[1, 1, 1], padding=[1, 0, 0], bias=False)
(a_bn): BatchNorm3d(512, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
(a_relu): ReLU(inplace=True)
(b): Conv3d(512, 512, kernel_size=[1, 3, 3], stride=[1, 1, 1], padding=[0, 2, 2], dilation=[1, 2, 2], bias=False)
(b_bn): BatchNorm3d(512, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
(b_relu): ReLU(inplace=True)
(c): Conv3d(512, 2048, kernel_size=[1, 1, 1], stride=[1, 1, 1], padding=[0, 0, 0], bias=False)
(c_bn): BatchNorm3d(2048, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
)
(relu): ReLU(inplace=True)
)
(pathway1_res0): ResBlock(
(branch1): Conv3d(128, 256, kernel_size=(1, 1, 1), stride=[1, 1, 1], bias=False)
(branch1_bn): BatchNorm3d(256, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
(branch2): BottleneckTransform(
(a): Conv3d(128, 64, kernel_size=[3, 1, 1], stride=[1, 1, 1], padding=[1, 0, 0], bias=False)
(a_bn): BatchNorm3d(64, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
(a_relu): ReLU(inplace=True)
(b): Conv3d(64, 64, kernel_size=[1, 3, 3], stride=[1, 1, 1], padding=[0, 2, 2], dilation=[1, 2, 2], bias=False)
(b_bn): BatchNorm3d(64, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
(b_relu): ReLU(inplace=True)
(c): Conv3d(64, 256, kernel_size=[1, 1, 1], stride=[1, 1, 1], padding=[0, 0, 0], bias=False)
(c_bn): BatchNorm3d(256, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
)
(relu): ReLU(inplace=True)
)
(pathway1_res1): ResBlock(
(branch2): BottleneckTransform(
(a): Conv3d(256, 64, kernel_size=[3, 1, 1], stride=[1, 1, 1], padding=[1, 0, 0], bias=False)
(a_bn): BatchNorm3d(64, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
(a_relu): ReLU(inplace=True)
(b): Conv3d(64, 64, kernel_size=[1, 3, 3], stride=[1, 1, 1], padding=[0, 2, 2], dilation=[1, 2, 2], bias=False)
(b_bn): BatchNorm3d(64, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
(b_relu): ReLU(inplace=True)
(c): Conv3d(64, 256, kernel_size=[1, 1, 1], stride=[1, 1, 1], padding=[0, 0, 0], bias=False)
(c_bn): BatchNorm3d(256, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
)
(relu): ReLU(inplace=True)
)
(pathway1_res2): ResBlock(
(branch2): BottleneckTransform(
(a): Conv3d(256, 64, kernel_size=[3, 1, 1], stride=[1, 1, 1], padding=[1, 0, 0], bias=False)
(a_bn): BatchNorm3d(64, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
(a_relu): ReLU(inplace=True)
(b): Conv3d(64, 64, kernel_size=[1, 3, 3], stride=[1, 1, 1], padding=[0, 2, 2], dilation=[1, 2, 2], bias=False)
(b_bn): BatchNorm3d(64, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
(b_relu): ReLU(inplace=True)
(c): Conv3d(64, 256, kernel_size=[1, 1, 1], stride=[1, 1, 1], padding=[0, 0, 0], bias=False)
(c_bn): BatchNorm3d(256, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
)
(relu): ReLU(inplace=True)
)
)
(head): ResNetRoIHead(
(s0_tpool): AvgPool3d(kernel_size=[8, 1, 1], stride=1, padding=0)
(s0_roi): ROIAlign(output_size=[7, 7], spatial_scale=0.0625, sampling_ratio=0, aligned=True)
(s0_spool): MaxPool2d(kernel_size=[7, 7], stride=1, padding=0, dilation=1, ceil_mode=False)
(s1_tpool): AvgPool3d(kernel_size=[32, 1, 1], stride=1, padding=0)
(s1_roi): ROIAlign(output_size=[7, 7], spatial_scale=0.0625, sampling_ratio=0, aligned=True)
(s1_spool): MaxPool2d(kernel_size=[7, 7], stride=1, padding=0, dilation=1, ceil_mode=False)
(dropout): Dropout(p=0.5, inplace=False)
(projection): Linear(in_features=2304, out_features=80, bias=True)
(act): Sigmoid()
)
)
[03/27 20:46:54] pa.utils.misc INFO: Params: 33,828,888
[03/27 20:46:54] pa.utils.misc INFO: Mem: 260.05322265625 MB
[03/27 20:46:54] pa.utils.misc INFO: FLOPs: 74.18038272 GFLOPs
[03/27 20:46:54] pa.utils.misc INFO: nvidia-smi
[03/27 20:46:54] pyaction INFO: Load from given checkpoint file.
[03/27 20:46:55] pa.utils.checkpoint INFO: res_conv1_bn_b: (64,) => s1.pathway0_stem.bn.bias: (64,)
[03/27 20:46:55] pa.utils.checkpoint INFO: res5_2_branch2a_bn_riv: (512,) => s5.pathway0_res2.branch2.a_bn.running_var: (512,)
[03/27 20:46:55] pa.utils.checkpoint INFO: t_res4_1_branch2c_bn_riv: (128,) => s4.pathway1_res1.branch2.c_bn.running_var: (128,)
[03/27 20:46:55] pa.utils.checkpoint INFO: t_res3_2_branch2b_bn_rm: (16,) => s3.pathway1_res2.branch2.b_bn.running_mean: (16,)
[03/27 20:46:55] pa.utils.checkpoint INFO: res_conv1_bn_s: (64,) => s1.pathway0_stem.bn.weight: (64,)
[03/27 20:46:55] pa.utils.checkpoint INFO: t_res3_3_branch2b_bn_riv: (16,) => s3.pathway1_res3.branch2.b_bn.running_var: (16,)
[03/27 20:46:55] pa.utils.checkpoint INFO: res4_4_branch2b_w: (256, 256, 1, 3, 3) => s4.pathway0_res4.branch2.b.weight: (256, 256, 1, 3, 3)
[03/27 20:46:55] pa.utils.checkpoint INFO: t_res2_1_branch2b_w: (8, 8, 1, 3, 3) => s2.pathway1_res1.branch2.b.weight: (8, 8, 1, 3, 3)
[03/27 20:46:55] pa.utils.checkpoint INFO: t_res5_1_branch2b_bn_riv: (64,) => s5.pathway1_res1.branch2.b_bn.running_var: (64,)
[03/27 20:46:55] pa.utils.checkpoint INFO: t_res4_2_branch2a_bn_riv: (32,) => s4.pathway1_res2.branch2.a_bn.running_var: (32,)
[03/27 20:46:55] pa.utils.checkpoint INFO: res3_1_branch2c_bn_s: (512,) => s3.pathway0_res1.branch2.c_bn.weight: (512,)
[03/27 20:46:55] pa.utils.checkpoint INFO: res5_1_branch2b_w: (512, 512, 1, 3, 3) => s5.pathway0_res1.branch2.b.weight: (512, 512, 1, 3, 3)
[03/27 20:46:55] pa.utils.checkpoint INFO: res3_0_branch2a_bn_riv: (128,) => s3.pathway0_res0.branch2.a_bn.running_var: (128,)
[03/27 20:46:55] pa.utils.checkpoint INFO: t_res2_2_branch2a_bn_rm: (8,) => s2.pathway1_res2.branch2.a_bn.running_mean: (8,)
[03/27 20:46:55] pa.utils.checkpoint INFO: t_res5_0_branch1_bn_rm: (256,) => s5.pathway1_res0.branch1_bn.running_mean: (256,)
[03/27 20:46:55] pa.utils.checkpoint INFO: t_res2_1_branch2b_bn_rm: (8,) => s2.pathway1_res1.branch2.b_bn.running_mean: (8,)
[03/27 20:46:55] pa.utils.checkpoint INFO: t_res3_3_branch2c_bn_riv: (64,) => s3.pathway1_res3.branch2.c_bn.running_var: (64,)
[03/27 20:46:55] pa.utils.checkpoint INFO: res2_2_branch2b_bn_rm: (64,) => s2.pathway0_res2.branch2.b_bn.running_mean: (64,)
[03/27 20:46:55] pa.utils.checkpoint INFO: t_res4_3_branch2c_bn_rm: (128,) => s4.pathway1_res3.branch2.c_bn.running_mean: (128,)
[03/27 20:46:55] pa.utils.checkpoint INFO: res3_3_branch2c_bn_riv: (512,) => s3.pathway0_res3.branch2.c_bn.running_var: (512,)
[03/27 20:46:55] pa.utils.checkpoint INFO: res5_0_branch2c_bn_riv: (2048,) => s5.pathway0_res0.branch2.c_bn.running_var: (2048,)
[03/27 20:46:55] pa.utils.checkpoint INFO: res3_2_branch2a_w: (128, 512, 1, 1, 1) => s3.pathway0_res2.branch2.a.weight: (128, 512, 1, 1, 1)
[03/27 20:46:55] pa.utils.checkpoint INFO: t_res4_5_branch2b_w: (32, 32, 1, 3, 3) => s4.pathway1_res5.branch2.b.weight: (32, 32, 1, 3, 3)
[03/27 20:46:55] pa.utils.checkpoint INFO: res3_0_branch1_bn_b: (512,) => s3.pathway0_res0.branch1_bn.bias: (512,)
[03/27 20:46:55] pa.utils.checkpoint INFO: t_pool1_subsample_bn_rm: (16,) => s1_fuse.bn.running_mean: (16,)
[03/27 20:46:55] pa.utils.checkpoint INFO: t_res5_2_branch2c_bn_riv: (256,) => s5.pathway1_res2.branch2.c_bn.running_var: (256,)
[03/27 20:46:55] pa.utils.checkpoint INFO: res3_2_branch2c_w: (512, 128, 1, 1, 1) => s3.pathway0_res2.branch2.c.weight: (512, 128, 1, 1, 1)
[03/27 20:46:55] pa.utils.checkpoint INFO: res2_2_branch2a_bn_riv: (64,) => s2.pathway0_res2.branch2.a_bn.running_var: (64,)
[03/27 20:46:55] pa.utils.checkpoint INFO: res5_2_branch2a_bn_rm: (512,) => s5.pathway0_res2.branch2.a_bn.running_mean: (512,)
[03/27 20:46:55] pa.utils.checkpoint INFO: t_res3_2_branch2b_bn_b: (16,) => s3.pathway1_res2.branch2.b_bn.bias: (16,)
[03/27 20:46:55] pa.utils.checkpoint INFO: t_res3_1_branch2a_bn_b: (16,) => s3.pathway1_res1.branch2.a_bn.bias: (16,)
[03/27 20:46:55] pa.utils.checkpoint INFO: res4_5_branch2a_w: (256, 1024, 3, 1, 1) => s4.pathway0_res5.branch2.a.weight: (256, 1024, 3, 1, 1)
[03/27 20:46:55] pa.utils.checkpoint INFO: t_res5_0_branch2c_bn_riv: (256,) => s5.pathway1_res0.branch2.c_bn.running_var: (256,)
[03/27 20:46:55] pa.utils.checkpoint INFO: t_res2_0_branch2a_w: (8, 8, 3, 1, 1) => s2.pathway1_res0.branch2.a.weight: (8, 8, 3, 1, 1)
[03/27 20:46:55] pa.utils.checkpoint INFO: t_res4_2_branch2c_w: (128, 32, 1, 1, 1) => s4.pathway1_res2.branch2.c.weight: (128, 32, 1, 1, 1)
[03/27 20:46:55] pa.utils.checkpoint INFO: res2_2_branch2a_bn_b: (64,) => s2.pathway0_res2.branch2.a_bn.bias: (64,)
[03/27 20:46:55] pa.utils.checkpoint INFO: t_res5_2_branch2c_bn_rm: (256,) => s5.pathway1_res2.branch2.c_bn.running_mean: (256,)
[03/27 20:46:55] pa.utils.checkpoint INFO: res2_2_branch2a_bn_s: (64,) => s2.pathway0_res2.branch2.a_bn.weight: (64,)
[03/27 20:46:55] pa.utils.checkpoint INFO: t_res4_1_branch2b_w: (32, 32, 1, 3, 3) => s4.pathway1_res1.branch2.b.weight: (32, 32, 1, 3, 3)
[03/27 20:46:55] pa.utils.checkpoint INFO: res5_1_branch2c_w: (2048, 512, 1, 1, 1) => s5.pathway0_res1.branch2.c.weight: (2048, 512, 1, 1, 1)
[03/27 20:46:55] pa.utils.checkpoint INFO: t_res5_2_branch2a_bn_s: (64,) => s5.pathway1_res2.branch2.a_bn.weight: (64,)
[03/27 20:46:55] pa.utils.checkpoint INFO: t_res4_0_branch2c_bn_s: (128,) => s4.pathway1_res0.branch2.c_bn.weight: (128,)
[03/27 20:46:55] pa.utils.checkpoint INFO: res5_1_branch2c_bn_rm: (2048,) => s5.pathway0_res1.branch2.c_bn.running_mean: (2048,)
[03/27 20:46:55] pa.utils.checkpoint INFO: t_res5_2_branch2a_bn_b: (64,) => s5.pathway1_res2.branch2.a_bn.bias: (64,)
[03/27 20:46:55] pa.utils.checkpoint INFO: t_res3_0_branch2c_bn_riv: (64,) => s3.pathway1_res0.branch2.c_bn.running_var: (64,)
[03/27 20:46:55] pa.utils.checkpoint INFO: t_res4_0_branch2c_bn_b: (128,) => s4.pathway1_res0.branch2.c_bn.bias: (128,)
[03/27 20:46:55] pa.utils.checkpoint INFO: res4_0_branch2b_w: (256, 256, 1, 3, 3) => s4.pathway0_res0.branch2.b.weight: (256, 256, 1, 3, 3)
[03/27 20:46:55] pa.utils.checkpoint INFO: t_res4_2_branch2c_bn_rm: (128,) => s4.pathway1_res2.branch2.c_bn.running_mean: (128,)
[03/27 20:46:55] pa.utils.checkpoint INFO: t_res4_1_branch2b_bn_riv: (32,) => s4.pathway1_res1.branch2.b_bn.running_var: (32,)
[03/27 20:46:55] pa.utils.checkpoint INFO: res3_2_branch2a_bn_b: (128,) => s3.pathway0_res2.branch2.a_bn.bias: (128,)
[03/27 20:46:55] pa.utils.checkpoint INFO: t_res2_1_branch2b_bn_b: (8,) => s2.pathway1_res1.branch2.b_bn.bias: (8,)
[03/27 20:46:55] pa.utils.checkpoint INFO: res4_4_branch2b_bn_b: (256,) => s4.pathway0_res4.branch2.b_bn.bias: (256,)
[03/27 20:46:55] pa.utils.checkpoint INFO: t_res4_0_branch1_bn_rm: (128,) => s4.pathway1_res0.branch1_bn.running_mean: (128,)
[03/27 20:46:55] pa.utils.checkpoint INFO: t_res5_0_branch2b_bn_riv: (64,) => s5.pathway1_res0.branch2.b_bn.running_var: (64,)
[03/27 20:46:55] pa.utils.checkpoint INFO: res3_1_branch2c_bn_b: (512,) => s3.pathway0_res1.branch2.c_bn.bias: (512,)
[03/27 20:46:55] pa.utils.checkpoint INFO: res4_0_branch2c_bn_riv: (1024,) => s4.pathway0_res0.branch2.c_bn.running_var: (1024,)
[03/27 20:46:55] pa.utils.checkpoint INFO: t_res4_1_branch2a_bn_s: (32,) => s4.pathway1_res1.branch2.a_bn.weight: (32,)
[03/27 20:46:55] pa.utils.checkpoint INFO: res3_3_branch2b_bn_riv: (128,) => s3.pathway0_res3.branch2.b_bn.running_var: (128,)
[03/27 20:46:55] pa.utils.checkpoint INFO: res5_1_branch2a_bn_riv: (512,) => s5.pathway0_res1.branch2.a_bn.running_var: (512,)
[03/27 20:46:55] pa.utils.checkpoint INFO: res4_4_branch2b_bn_s: (256,) => s4.pathway0_res4.branch2.b_bn.weight: (256,)
[03/27 20:46:55] pa.utils.checkpoint INFO: res5_0_branch1_bn_rm: (2048,) => s5.pathway0_res0.branch1_bn.running_mean: (2048,)
[03/27 20:46:55] pa.utils.checkpoint INFO: t_res4_1_branch2a_bn_b: (32,) => s4.pathway1_res1.branch2.a_bn.bias: (32,)
[03/27 20:46:55] pa.utils.checkpoint INFO: t_res3_3_branch2c_bn_subsample_bn_s: (128,) => s3_fuse.bn.weight: (128,)
[03/27 20:46:55] pa.utils.checkpoint INFO: res4_4_branch2a_bn_rm: (256,) => s4.pathway0_res4.branch2.a_bn.running_mean: (256,)
[03/27 20:46:55] pa.utils.checkpoint INFO: res3_0_branch1_w: (512, 320, 1, 1, 1) => s3.pathway0_res0.branch1.weight: (512, 320, 1, 1, 1)
[03/27 20:46:55] pa.utils.checkpoint INFO: t_res3_0_branch2b_bn_rm: (16,) => s3.pathway1_res0.branch2.b_bn.running_mean: (16,)
[03/27 20:46:55] pa.utils.checkpoint INFO: t_res3_3_branch2c_bn_subsample_bn_b: (128,) => s3_fuse.bn.bias: (128,)
[03/27 20:46:55] pa.utils.checkpoint INFO: res5_0_branch2a_bn_riv: (512,) => s5.pathway0_res0.branch2.a_bn.running_var: (512,)
[03/27 20:46:55] pa.utils.checkpoint INFO: t_res4_4_branch2a_bn_riv: (32,) => s4.pathway1_res4.branch2.a_bn.running_var: (32,)
[03/27 20:46:55] pa.utils.checkpoint INFO: res4_5_branch2c_bn_riv: (1024,) => s4.pathway0_res5.branch2.c_bn.running_var: (1024,)
[03/27 20:46:55] pa.utils.checkpoint INFO: t_res2_0_branch2c_bn_riv: (32,) => s2.pathway1_res0.branch2.c_bn.running_var: (32,)
[03/27 20:46:55] pa.utils.checkpoint INFO: t_res2_2_branch2c_bn_s: (32,) => s2.pathway1_res2.branch2.c_bn.weight: (32,)
[03/27 20:46:56] pa.utils.checkpoint INFO: t_res4_3_branch2a_bn_riv: (32,) => s4.pathway1_res3.branch2.a_bn.running_var: (32,)
[03/27 20:46:56] pa.utils.checkpoint INFO: t_res4_5_branch2a_bn_s: (32,) => s4.pathway1_res5.branch2.a_bn.weight: (32,)
[03/27 20:46:56] pa.utils.checkpoint INFO: t_res2_0_branch2b_bn_b: (8,) => s2.pathway1_res0.branch2.b_bn.bias: (8,)
[03/27 20:46:56] pa.utils.checkpoint INFO: t_res2_2_branch2c_bn_b: (32,) => s2.pathway1_res2.branch2.c_bn.bias: (32,)
[03/27 20:46:56] pa.utils.checkpoint INFO: t_res4_4_branch2b_bn_rm: (32,) => s4.pathway1_res4.branch2.b_bn.running_mean: (32,)
[03/27 20:46:56] pa.utils.checkpoint INFO: t_res2_2_branch2b_bn_b: (8,) => s2.pathway1_res2.branch2.b_bn.bias: (8,)
[03/27 20:46:56] pa.utils.checkpoint INFO: res3_0_branch1_bn_rm: (512,) => s3.pathway0_res0.branch1_bn.running_mean: (512,)
[03/27 20:46:56] pa.utils.checkpoint INFO: res4_1_branch2c_w: (1024, 256, 1, 1, 1) => s4.pathway0_res1.branch2.c.weight: (1024, 256, 1, 1, 1)
[03/27 20:46:56] pa.utils.checkpoint INFO: res2_1_branch2c_bn_riv: (256,) => s2.pathway0_res1.branch2.c_bn.running_var: (256,)
[03/27 20:46:56] pa.utils.checkpoint INFO: t_res5_2_branch2b_w: (64, 64, 1, 3, 3) => s5.pathway1_res2.branch2.b.weight: (64, 64, 1, 3, 3)
[03/27 20:46:56] pa.utils.checkpoint INFO: t_res2_0_branch2c_w: (32, 8, 1, 1, 1) => s2.pathway1_res0.branch2.c.weight: (32, 8, 1, 1, 1)
[03/27 20:46:56] pa.utils.checkpoint INFO: res4_5_branch2c_w: (1024, 256, 1, 1, 1) => s4.pathway0_res5.branch2.c.weight: (1024, 256, 1, 1, 1)
[03/27 20:46:56] pa.utils.checkpoint INFO: res3_3_branch2c_bn_b: (512,) => s3.pathway0_res3.branch2.c_bn.bias: (512,)
[03/27 20:46:56] pa.utils.checkpoint INFO: t_res3_2_branch2c_bn_rm: (64,) => s3.pathway1_res2.branch2.c_bn.running_mean: (64,)
[03/27 20:46:56] pa.utils.checkpoint INFO: t_res4_0_branch1_bn_riv: (128,) => s4.pathway1_res0.branch1_bn.running_var: (128,)
[03/27 20:46:56] pa.utils.checkpoint INFO: t_res5_1_branch2b_bn_b: (64,) => s5.pathway1_res1.branch2.b_bn.bias: (64,)
[03/27 20:46:56] pa.utils.checkpoint INFO: t_res5_1_branch2b_w: (64, 64, 1, 3, 3) => s5.pathway1_res1.branch2.b.weight: (64, 64, 1, 3, 3)
[03/27 20:46:56] pa.utils.checkpoint INFO: res3_3_branch2c_bn_s: (512,) => s3.pathway0_res3.branch2.c_bn.weight: (512,)
[03/27 20:46:56] pa.utils.checkpoint INFO: res3_2_branch2b_w: (128, 128, 1, 3, 3) => s3.pathway0_res2.branch2.b.weight: (128, 128, 1, 3, 3)
[03/27 20:46:56] pa.utils.checkpoint INFO: res5_1_branch2a_w: (512, 2048, 3, 1, 1) => s5.pathway0_res1.branch2.a.weight: (512, 2048, 3, 1, 1)
[03/27 20:46:56] pa.utils.checkpoint INFO: t_res3_3_branch2a_bn_riv: (16,) => s3.pathway1_res3.branch2.a_bn.running_var: (16,)
[03/27 20:46:56] pa.utils.checkpoint INFO: t_res2_2_branch2c_w: (32, 8, 1, 1, 1) => s2.pathway1_res2.branch2.c.weight: (32, 8, 1, 1, 1)
[03/27 20:46:56] pa.utils.checkpoint INFO: t_res4_5_branch2c_bn_subsample_bn_b: (256,) => s4_fuse.bn.bias: (256,)
[03/27 20:46:56] pa.utils.checkpoint INFO: res4_5_branch2b_bn_b: (256,) => s4.pathway0_res5.branch2.b_bn.bias: (256,)
[03/27 20:46:56] pa.utils.checkpoint INFO: t_res3_2_branch2a_bn_riv: (16,) => s3.pathway1_res2.branch2.a_bn.running_var: (16,)
[03/27 20:46:56] pa.utils.checkpoint INFO: t_res2_2_branch2a_w: (8, 32, 3, 1, 1) => s2.pathway1_res2.branch2.a.weight: (8, 32, 3, 1, 1)
[03/27 20:46:56] pa.utils.checkpoint INFO: res2_0_branch1_bn_s: (256,) => s2.pathway0_res0.branch1_bn.weight: (256,)
[03/27 20:46:56] pa.utils.checkpoint INFO: t_res2_0_branch2b_bn_s: (8,) => s2.pathway1_res0.branch2.b_bn.weight: (8,)
[03/27 20:46:56] pa.utils.checkpoint INFO: t_res4_5_branch2c_bn_subsample_bn_s: (256,) => s4_fuse.bn.weight: (256,)
[03/27 20:46:56] pa.utils.checkpoint INFO: res4_5_branch2b_bn_s: (256,) => s4.pathway0_res5.branch2.b_bn.weight: (256,)
[03/27 20:46:56] pa.utils.checkpoint INFO: t_res3_1_branch2b_bn_rm: (16,) => s3.pathway1_res1.branch2.b_bn.running_mean: (16,)
[03/27 20:46:56] pa.utils.checkpoint INFO: t_res5_2_branch2b_bn_riv: (64,) => s5.pathway1_res2.branch2.b_bn.running_var: (64,)
[03/27 20:46:56] pa.utils.checkpoint INFO: res4_4_branch2c_bn_b: (1024,) => s4.pathway0_res4.branch2.c_bn.bias: (1024,)
[03/27 20:46:56] pa.utils.checkpoint INFO: t_res2_1_branch2c_bn_b: (32,) => s2.pathway1_res1.branch2.c_bn.bias: (32,)
[03/27 20:46:56] pa.utils.checkpoint INFO: res3_3_branch2b_bn_s: (128,) => s3.pathway0_res3.branch2.b_bn.weight: (128,)
[03/27 20:46:56] pa.utils.checkpoint INFO: t_res3_2_branch2c_bn_riv: (64,) => s3.pathway1_res2.branch2.c_bn.running_var: (64,)
[03/27 20:46:56] pa.utils.checkpoint INFO: res2_1_branch2a_bn_rm: (64,) => s2.pathway0_res1.branch2.a_bn.running_mean: (64,)
[03/27 20:46:56] pa.utils.checkpoint INFO: t_res4_5_branch2c_bn_subsample_bn_rm: (256,) => s4_fuse.bn.running_mean: (256,)
[03/27 20:46:56] pa.utils.checkpoint INFO: res5_1_branch2b_bn_b: (512,) => s5.pathway0_res1.branch2.b_bn.bias: (512,)
[03/27 20:46:56] pa.utils.checkpoint INFO: res3_1_branch2a_bn_riv: (128,) => s3.pathway0_res1.branch2.a_bn.running_var: (128,)
[03/27 20:46:56] pa.utils.checkpoint INFO: res5_1_branch2b_bn_s: (512,) => s5.pathway0_res1.branch2.b_bn.weight: (512,)
[03/27 20:46:56] pa.utils.checkpoint INFO: res4_4_branch2c_w: (1024, 256, 1, 1, 1) => s4.pathway0_res4.branch2.c.weight: (1024, 256, 1, 1, 1)
[03/27 20:46:56] pa.utils.checkpoint INFO: t_res2_1_branch2c_w: (32, 8, 1, 1, 1) => s2.pathway1_res1.branch2.c.weight: (32, 8, 1, 1, 1)
[03/27 20:46:56] pa.utils.checkpoint INFO: t_res5_0_branch1_w: (256, 128, 1, 1, 1) => s5.pathway1_res0.branch1.weight: (256, 128, 1, 1, 1)
[03/27 20:46:56] pa.utils.checkpoint INFO: res2_2_branch2c_bn_riv: (256,) => s2.pathway0_res2.branch2.c_bn.running_var: (256,)
[03/27 20:46:56] pa.utils.checkpoint INFO: t_res4_5_branch2c_bn_subsample_w: (256, 128, 7, 1, 1) => s4_fuse.conv_f2s.weight: (256, 128, 7, 1, 1)
[03/27 20:46:56] pa.utils.checkpoint INFO: res4_3_branch2b_bn_rm: (256,) => s4.pathway0_res3.branch2.b_bn.running_mean: (256,)
[03/27 20:46:56] pa.utils.checkpoint INFO: t_res_conv1_bn_rm: (8,) => s1.pathway1_stem.bn.running_mean: (8,)
[03/27 20:46:56] pa.utils.checkpoint INFO: t_res3_2_branch2b_bn_riv: (16,) => s3.pathway1_res2.branch2.b_bn.running_var: (16,)
[03/27 20:46:56] pa.utils.checkpoint INFO: res4_2_branch2c_bn_b: (1024,) => s4.pathway0_res2.branch2.c_bn.bias: (1024,)
[03/27 20:46:56] pa.utils.checkpoint INFO: res3_3_branch2a_bn_riv: (128,) => s3.pathway0_res3.branch2.a_bn.running_var: (128,)
[03/27 20:46:56] pa.utils.checkpoint INFO: t_res3_1_branch2a_w: (16, 64, 3, 1, 1) => s3.pathway1_res1.branch2.a.weight: (16, 64, 3, 1, 1)
[03/27 20:46:56] pa.utils.checkpoint INFO: res2_2_branch2a_w: (64, 256, 1, 1, 1) => s2.pathway0_res2.branch2.a.weight: (64, 256, 1, 1, 1)
[03/27 20:46:56] pa.utils.checkpoint INFO: res2_0_branch2c_bn_riv: (256,) => s2.pathway0_res0.branch2.c_bn.running_var: (256,)
[03/27 20:46:56] pa.utils.checkpoint INFO: res2_0_branch2b_bn_riv: (64,) => s2.pathway0_res0.branch2.b_bn.running_var: (64,)
[03/27 20:46:56] pa.utils.checkpoint INFO: res5_2_branch2b_bn_riv: (512,) => s5.pathway0_res2.branch2.b_bn.running_var: (512,)
[03/27 20:46:56] pa.utils.checkpoint INFO: t_res4_4_branch2a_bn_rm: (32,) => s4.pathway1_res4.branch2.a_bn.running_mean: (32,)
[03/27 20:46:56] pa.utils.checkpoint INFO: t_res4_0_branch2a_bn_s: (32,) => s4.pathway1_res0.branch2.a_bn.weight: (32,)
[03/27 20:46:56] pa.utils.checkpoint INFO: t_res4_4_branch2a_bn_s: (32,) => s4.pathway1_res4.branch2.a_bn.weight: (32,)
[03/27 20:46:56] pa.utils.checkpoint INFO: t_res3_1_branch2c_w: (64, 16, 1, 1, 1) => s3.pathway1_res1.branch2.c.weight: (64, 16, 1, 1, 1)
[03/27 20:46:56] pa.utils.checkpoint INFO: t_res4_3_branch2b_bn_rm: (32,) => s4.pathway1_res3.branch2.b_bn.running_mean: (32,)
[03/27 20:46:56] pa.utils.checkpoint INFO: res4_1_branch2a_bn_s: (256,) => s4.pathway0_res1.branch2.a_bn.weight: (256,)
[03/27 20:46:56] pa.utils.checkpoint INFO: res2_1_branch2c_bn_b: (256,) => s2.pathway0_res1.branch2.c_bn.bias: (256,)
[03/27 20:46:56] pa.utils.checkpoint INFO: res3_2_branch2b_bn_b: (128,) => s3.pathway0_res2.branch2.b_bn.bias: (128,)
[03/27 20:46:56] pa.utils.checkpoint INFO: res3_1_branch2b_bn_s: (128,) => s3.pathway0_res1.branch2.b_bn.weight: (128,)
[03/27 20:46:56] pa.utils.checkpoint INFO: res4_1_branch2a_bn_b: (256,) => s4.pathway0_res1.branch2.a_bn.bias: (256,)
[03/27 20:46:56] pa.utils.checkpoint INFO: res3_1_branch2a_bn_b: (128,) => s3.pathway0_res1.branch2.a_bn.bias: (128,)
[03/27 20:46:56] pa.utils.checkpoint INFO: t_res2_1_branch2a_w: (8, 32, 3, 1, 1) => s2.pathway1_res1.branch2.a.weight: (8, 32, 3, 1, 1)
[03/27 20:46:56] pa.utils.checkpoint INFO: res2_1_branch2c_bn_s: (256,) => s2.pathway0_res1.branch2.c_bn.weight: (256,)
[03/27 20:46:56] pa.utils.checkpoint INFO: t_res4_5_branch2b_bn_rm: (32,) => s4.pathway1_res5.branch2.b_bn.running_mean: (32,)
[03/27 20:46:56] pa.utils.checkpoint INFO: res3_2_branch2b_bn_s: (128,) => s3.pathway0_res2.branch2.b_bn.weight: (128,)
[03/27 20:46:56] pa.utils.checkpoint INFO: t_res3_2_branch2a_bn_rm: (16,) => s3.pathway1_res2.branch2.a_bn.running_mean: (16,)
[03/27 20:46:56] pa.utils.checkpoint INFO: t_res2_1_branch2c_bn_s: (32,) => s2.pathway1_res1.branch2.c_bn.weight: (32,)
[03/27 20:46:56] pa.utils.checkpoint INFO: res5_2_branch2c_bn_riv: (2048,) => s5.pathway0_res2.branch2.c_bn.running_var: (2048,)
[03/27 20:46:56] pa.utils.checkpoint INFO: res5_1_branch2c_bn_s: (2048,) => s5.pathway0_res1.branch2.c_bn.weight: (2048,)
[03/27 20:46:56] pa.utils.checkpoint INFO: t_res4_3_branch2a_bn_rm: (32,) => s4.pathway1_res3.branch2.a_bn.running_mean: (32,)
[03/27 20:46:56] pa.utils.checkpoint INFO: res5_1_branch2c_bn_b: (2048,) => s5.pathway0_res1.branch2.c_bn.bias: (2048,)
[03/27 20:46:56] pa.utils.checkpoint INFO: t_res4_5_branch2a_bn_riv: (32,) => s4.pathway1_res5.branch2.a_bn.running_var: (32,)
[03/27 20:46:56] pa.utils.checkpoint INFO: res2_1_branch2b_bn_rm: (64,) => s2.pathway0_res1.branch2.b_bn.running_mean: (64,)
[03/27 20:46:56] pa.utils.checkpoint INFO: t_res3_3_branch2a_bn_s: (16,) => s3.pathway1_res3.branch2.a_bn.weight: (16,)
[03/27 20:46:56] pa.utils.checkpoint INFO: res5_0_branch2c_bn_b: (2048,) => s5.pathway0_res0.branch2.c_bn.bias: (2048,)
[03/27 20:46:56] pa.utils.checkpoint INFO: t_res3_0_branch2c_bn_b: (64,) => s3.pathway1_res0.branch2.c_bn.bias: (64,)
[03/27 20:46:56] pa.utils.checkpoint INFO: res4_4_branch2b_bn_rm: (256,) => s4.pathway0_res4.branch2.b_bn.running_mean: (256,)
[03/27 20:46:56] pa.utils.checkpoint INFO: res5_1_branch2b_bn_rm: (512,) => s5.pathway0_res1.branch2.b_bn.running_mean: (512,)
[03/27 20:46:56] pa.utils.checkpoint INFO: res5_0_branch2b_w: (512, 512, 1, 3, 3) => s5.pathway0_res0.branch2.b.weight: (512, 512, 1, 3, 3)
[03/27 20:46:56] pa.utils.checkpoint INFO: t_res2_1_branch2b_bn_riv: (8,) => s2.pathway1_res1.branch2.b_bn.running_var: (8,)
[03/27 20:46:56] pa.utils.checkpoint INFO: t_res3_0_branch2c_bn_s: (64,) => s3.pathway1_res0.branch2.c_bn.weight: (64,)
[03/27 20:46:56] pa.utils.checkpoint INFO: t_res2_1_branch2a_bn_s: (8,) => s2.pathway1_res1.branch2.a_bn.weight: (8,)
[03/27 20:46:56] pa.utils.checkpoint INFO: res4_4_branch2a_bn_s: (256,) => s4.pathway0_res4.branch2.a_bn.weight: (256,)
[03/27 20:46:56] pa.utils.checkpoint INFO: t_res3_2_branch2b_bn_s: (16,) => s3.pathway1_res2.branch2.b_bn.weight: (16,)
[03/27 20:46:56] pa.utils.checkpoint INFO: t_res4_0_branch2c_bn_rm: (128,) => s4.pathway1_res0.branch2.c_bn.running_mean: (128,)
[03/27 20:46:56] pa.utils.checkpoint INFO: res4_2_branch2b_bn_s: (256,) => s4.pathway0_res2.branch2.b_bn.weight: (256,)
[03/27 20:46:56] pa.utils.checkpoint INFO: t_res2_1_branch2a_bn_b: (8,) => s2.pathway1_res1.branch2.a_bn.bias: (8,)
[03/27 20:46:56] pa.utils.checkpoint INFO: res4_4_branch2a_bn_b: (256,) => s4.pathway0_res4.branch2.a_bn.bias: (256,)
[03/27 20:46:56] pa.utils.checkpoint INFO: t_res4_5_branch2c_bn_subsample_bn_riv: (256,) => s4_fuse.bn.running_var: (256,)
[03/27 20:46:56] pa.utils.checkpoint INFO: res4_2_branch2b_bn_b: (256,) => s4.pathway0_res2.branch2.b_bn.bias: (256,)
[03/27 20:46:56] pa.utils.checkpoint INFO: t_res5_2_branch2c_w: (256, 64, 1, 1, 1) => s5.pathway1_res2.branch2.c.weight: (256, 64, 1, 1, 1)
[03/27 20:46:56] pa.utils.checkpoint INFO: res4_3_branch2c_bn_rm: (1024,) => s4.pathway0_res3.branch2.c_bn.running_mean: (1024,)
[03/27 20:46:56] pa.utils.checkpoint INFO: t_res4_3_branch2a_bn_b: (32,) => s4.pathway1_res3.branch2.a_bn.bias: (32,)
[03/27 20:46:56] pa.utils.checkpoint INFO: t_res3_0_branch2c_w: (64, 16, 1, 1, 1) => s3.pathway1_res0.branch2.c.weight: (64, 16, 1, 1, 1)
[03/27 20:46:56] pa.utils.checkpoint INFO: t_res4_2_branch2a_w: (32, 128, 3, 1, 1) => s4.pathway1_res2.branch2.a.weight: (32, 128, 3, 1, 1)
[03/27 20:46:56] pa.utils.checkpoint INFO: res4_5_branch2b_w: (256, 256, 1, 3, 3) => s4.pathway0_res5.branch2.b.weight: (256, 256, 1, 3, 3)
[03/27 20:46:56] pa.utils.checkpoint INFO: t_res4_3_branch2a_bn_s: (32,) => s4.pathway1_res3.branch2.a_bn.weight: (32,)
[03/27 20:46:57] pa.utils.checkpoint INFO: res4_3_branch2c_bn_riv: (1024,) => s4.pathway0_res3.branch2.c_bn.running_var: (1024,)
[03/27 20:46:57] pa.utils.checkpoint INFO: t_res4_2_branch2a_bn_rm: (32,) => s4.pathway1_res2.branch2.a_bn.running_mean: (32,)
[03/27 20:46:57] pa.utils.checkpoint INFO: res3_2_branch2a_bn_rm: (128,) => s3.pathway0_res2.branch2.a_bn.running_mean: (128,)
[03/27 20:46:57] pa.utils.checkpoint INFO: res4_2_branch2c_bn_rm: (1024,) => s4.pathway0_res2.branch2.c_bn.running_mean: (1024,)
[03/27 20:46:57] pa.utils.checkpoint INFO: res4_0_branch2c_bn_rm: (1024,) => s4.pathway0_res0.branch2.c_bn.running_mean: (1024,)
[03/27 20:46:57] pa.utils.checkpoint INFO: t_res5_1_branch2c_bn_riv: (256,) => s5.pathway1_res1.branch2.c_bn.running_var: (256,)
[03/27 20:46:57] pa.utils.checkpoint INFO: t_res4_4_branch2c_bn_rm: (128,) => s4.pathway1_res4.branch2.c_bn.running_mean: (128,)
[03/27 20:46:57] pa.utils.checkpoint INFO: res3_0_branch2a_bn_s: (128,) => s3.pathway0_res0.branch2.a_bn.weight: (128,)
[03/27 20:46:57] pa.utils.checkpoint INFO: t_res3_0_branch2b_bn_riv: (16,) => s3.pathway1_res0.branch2.b_bn.running_var: (16,)
[03/27 20:46:57] pa.utils.checkpoint INFO: t_res3_3_branch2c_bn_subsample_bn_rm: (128,) => s3_fuse.bn.running_mean: (128,)
[03/27 20:46:57] pa.utils.checkpoint INFO: res3_0_branch2a_bn_b: (128,) => s3.pathway0_res0.branch2.a_bn.bias: (128,)
[03/27 20:46:57] pa.utils.checkpoint INFO: res4_4_branch2a_w: (256, 1024, 3, 1, 1) => s4.pathway0_res4.branch2.a.weight: (256, 1024, 3, 1, 1)
[03/27 20:46:57] pa.utils.checkpoint INFO: t_res2_2_branch2c_bn_riv: (32,) => s2.pathway1_res2.branch2.c_bn.running_var: (32,)
[03/27 20:46:57] pa.utils.checkpoint INFO: t_res3_3_branch2a_bn_rm: (16,) => s3.pathway1_res3.branch2.a_bn.running_mean: (16,)
[03/27 20:46:57] pa.utils.checkpoint INFO: t_res2_2_branch2b_w: (8, 8, 1, 3, 3) => s2.pathway1_res2.branch2.b.weight: (8, 8, 1, 3, 3)
[03/27 20:46:57] pa.utils.checkpoint INFO: t_res2_0_branch2a_bn_rm: (8,) => s2.pathway1_res0.branch2.a_bn.running_mean: (8,)
[03/27 20:46:57] pa.utils.checkpoint INFO: t_res3_0_branch2c_bn_rm: (64,) => s3.pathway1_res0.branch2.c_bn.running_mean: (64,)
[03/27 20:46:57] pa.utils.checkpoint INFO: res4_5_branch2a_bn_rm: (256,) => s4.pathway0_res5.branch2.a_bn.running_mean: (256,)
[03/27 20:46:57] pa.utils.checkpoint INFO: t_res4_4_branch2b_bn_s: (32,) => s4.pathway1_res4.branch2.b_bn.weight: (32,)
[03/27 20:46:57] pa.utils.checkpoint INFO: res5_0_branch2b_bn_riv: (512,) => s5.pathway0_res0.branch2.b_bn.running_var: (512,)
[03/27 20:46:57] pa.utils.checkpoint INFO: t_res4_1_branch2b_bn_b: (32,) => s4.pathway1_res1.branch2.b_bn.bias: (32,)
[03/27 20:46:57] pa.utils.checkpoint INFO: res3_1_branch2c_bn_rm: (512,) => s3.pathway0_res1.branch2.c_bn.running_mean: (512,)
[03/27 20:46:57] pa.utils.checkpoint INFO: t_res4_4_branch2b_bn_b: (32,) => s4.pathway1_res4.branch2.b_bn.bias: (32,)
[03/27 20:46:57] pa.utils.checkpoint INFO: res3_1_branch2a_bn_s: (128,) => s3.pathway0_res1.branch2.a_bn.weight: (128,)
[03/27 20:46:57] pa.utils.checkpoint INFO: t_res3_3_branch2c_bn_b: (64,) => s3.pathway1_res3.branch2.c_bn.bias: (64,)
[03/27 20:46:57] pa.utils.checkpoint INFO: t_res4_1_branch2b_bn_s: (32,) => s4.pathway1_res1.branch2.b_bn.weight: (32,)
[03/27 20:46:57] pa.utils.checkpoint INFO: t_res4_4_branch2a_bn_b: (32,) => s4.pathway1_res4.branch2.a_bn.bias: (32,)
[03/27 20:46:57] pa.utils.checkpoint INFO: res5_1_branch2a_bn_s: (512,) => s5.pathway0_res1.branch2.a_bn.weight: (512,)
[03/27 20:46:57] pa.utils.checkpoint INFO: res4_0_branch2a_bn_riv: (256,) => s4.pathway0_res0.branch2.a_bn.running_var: (256,)
[03/27 20:46:57] pa.utils.checkpoint INFO: res5_1_branch2a_bn_b: (512,) => s5.pathway0_res1.branch2.a_bn.bias: (512,)
[03/27 20:46:57] pa.utils.checkpoint INFO: res4_2_branch2b_bn_riv: (256,) => s4.pathway0_res2.branch2.b_bn.running_var: (256,)
[03/27 20:46:57] pa.utils.checkpoint INFO: t_res2_2_branch2b_bn_rm: (8,) => s2.pathway1_res2.branch2.b_bn.running_mean: (8,)
[03/27 20:46:57] pa.utils.checkpoint INFO: res3_0_branch2c_w: (512, 128, 1, 1, 1) => s3.pathway0_res0.branch2.c.weight: (512, 128, 1, 1, 1)
[03/27 20:46:57] pa.utils.checkpoint INFO: t_res3_0_branch2a_bn_s: (16,) => s3.pathway1_res0.branch2.a_bn.weight: (16,)
[03/27 20:46:57] pa.utils.checkpoint INFO: t_res4_2_branch2c_bn_riv: (128,) => s4.pathway1_res2.branch2.c_bn.running_var: (128,)
[03/27 20:46:57] pa.utils.checkpoint INFO: t_res3_0_branch2a_bn_b: (16,) => s3.pathway1_res0.branch2.a_bn.bias: (16,)
[03/27 20:46:57] pa.utils.checkpoint INFO: t_res3_1_branch2a_bn_riv: (16,) => s3.pathway1_res1.branch2.a_bn.running_var: (16,)
[03/27 20:46:57] pa.utils.checkpoint INFO: t_res2_0_branch2b_bn_riv: (8,) => s2.pathway1_res0.branch2.b_bn.running_var: (8,)
[03/27 20:46:57] pa.utils.checkpoint INFO: t_res3_1_branch2a_bn_s: (16,) => s3.pathway1_res1.branch2.a_bn.weight: (16,)
[03/27 20:46:57] pa.utils.checkpoint INFO: res4_5_branch2b_bn_riv: (256,) => s4.pathway0_res5.branch2.b_bn.running_var: (256,)
[03/27 20:46:57] pa.utils.checkpoint INFO: res5_2_branch2b_bn_rm: (512,) => s5.pathway0_res2.branch2.b_bn.running_mean: (512,)
[03/27 20:46:57] pa.utils.checkpoint INFO: t_res3_3_branch2c_bn_s: (64,) => s3.pathway1_res3.branch2.c_bn.weight: (64,)
[03/27 20:46:57] pa.utils.checkpoint INFO: t_res2_0_branch1_bn_riv: (32,) => s2.pathway1_res0.branch1_bn.running_var: (32,)
[03/27 20:46:57] pa.utils.checkpoint INFO: res2_2_branch2b_bn_riv: (64,) => s2.pathway0_res2.branch2.b_bn.running_var: (64,)
[03/27 20:46:57] pa.utils.checkpoint INFO: res2_0_branch1_bn_rm: (256,) => s2.pathway0_res0.branch1_bn.running_mean: (256,)
[03/27 20:46:57] pa.utils.checkpoint INFO: res2_2_branch2b_w: (64, 64, 1, 3, 3) => s2.pathway0_res2.branch2.b.weight: (64, 64, 1, 3, 3)
[03/27 20:46:57] pa.utils.checkpoint INFO: res2_1_branch2a_bn_riv: (64,) => s2.pathway0_res1.branch2.a_bn.running_var: (64,)
[03/27 20:46:57] pa.utils.checkpoint INFO: t_res4_2_branch2b_bn_rm: (32,) => s4.pathway1_res2.branch2.b_bn.running_mean: (32,)
[03/27 20:46:57] pa.utils.checkpoint INFO: res4_2_branch2a_bn_riv: (256,) => s4.pathway0_res2.branch2.a_bn.running_var: (256,)
[03/27 20:46:57] pa.utils.checkpoint INFO: res4_2_branch2c_bn_s: (1024,) => s4.pathway0_res2.branch2.c_bn.weight: (1024,)
[03/27 20:46:57] pa.utils.checkpoint INFO: res3_3_branch2a_bn_s: (128,) => s3.pathway0_res3.branch2.a_bn.weight: (128,)
[03/27 20:46:57] pa.utils.checkpoint INFO: t_res2_0_branch2a_bn_riv: (8,) => s2.pathway1_res0.branch2.a_bn.running_var: (8,)
[03/27 20:46:57] pa.utils.checkpoint INFO: res3_0_branch2b_w: (128, 128, 1, 3, 3) => s3.pathway0_res0.branch2.b.weight: (128, 128, 1, 3, 3)
[03/27 20:46:57] pa.utils.checkpoint INFO: res4_5_branch2a_bn_riv: (256,) => s4.pathway0_res5.branch2.a_bn.running_var: (256,)
[03/27 20:46:57] pa.utils.checkpoint INFO: t_res3_3_branch2b_bn_rm: (16,) => s3.pathway1_res3.branch2.b_bn.running_mean: (16,)
[03/27 20:46:57] pa.utils.checkpoint INFO: t_res2_0_branch1_bn_s: (32,) => s2.pathway1_res0.branch1_bn.weight: (32,)
[03/27 20:46:57] pa.utils.checkpoint INFO: res4_0_branch2b_bn_riv: (256,) => s4.pathway0_res0.branch2.b_bn.running_var: (256,)
[03/27 20:46:57] pa.utils.checkpoint INFO: t_res4_0_branch2b_bn_riv: (32,) => s4.pathway1_res0.branch2.b_bn.running_var: (32,)
[03/27 20:46:57] pa.utils.checkpoint INFO: t_res3_0_branch2b_bn_b: (16,) => s3.pathway1_res0.branch2.b_bn.bias: (16,)
[03/27 20:46:57] pa.utils.checkpoint INFO: t_res2_0_branch1_bn_b: (32,) => s2.pathway1_res0.branch1_bn.bias: (32,)
[03/27 20:46:57] pa.utils.checkpoint INFO: t_res5_0_branch1_bn_s: (256,) => s5.pathway1_res0.branch1_bn.weight: (256,)
[03/27 20:46:57] pa.utils.checkpoint INFO: t_res5_1_branch2b_bn_s: (64,) => s5.pathway1_res1.branch2.b_bn.weight: (64,)
[03/27 20:46:57] pa.utils.checkpoint INFO: t_res3_0_branch1_bn_riv: (64,) => s3.pathway1_res0.branch1_bn.running_var: (64,)
[03/27 20:46:57] pa.utils.checkpoint INFO: t_res2_2_branch2c_bn_subsample_bn_s: (64,) => s2_fuse.bn.weight: (64,)
[03/27 20:46:57] pa.utils.checkpoint INFO: res5_0_branch2c_w: (2048, 512, 1, 1, 1) => s5.pathway0_res0.branch2.c.weight: (2048, 512, 1, 1, 1)
[03/27 20:46:57] pa.utils.checkpoint INFO: t_res4_5_branch2c_bn_s: (128,) => s4.pathway1_res5.branch2.c_bn.weight: (128,)
[03/27 20:46:57] pa.utils.checkpoint INFO: t_res5_0_branch2b_bn_rm: (64,) => s5.pathway1_res0.branch2.b_bn.running_mean: (64,)
[03/27 20:46:57] pa.utils.checkpoint INFO: t_res4_5_branch2b_bn_s: (32,) => s4.pathway1_res5.branch2.b_bn.weight: (32,)
[03/27 20:46:57] pa.utils.checkpoint INFO: t_res4_5_branch2b_bn_riv: (32,) => s4.pathway1_res5.branch2.b_bn.running_var: (32,)
[03/27 20:46:57] pa.utils.checkpoint INFO: res5_2_branch2a_w: (512, 2048, 3, 1, 1) => s5.pathway0_res2.branch2.a.weight: (512, 2048, 3, 1, 1)
[03/27 20:46:58] pa.utils.checkpoint INFO: t_res5_1_branch2a_w: (64, 256, 3, 1, 1) => s5.pathway1_res1.branch2.a.weight: (64, 256, 3, 1, 1)
[03/27 20:46:58] pa.utils.checkpoint INFO: t_res4_3_branch2b_bn_riv: (32,) => s4.pathway1_res3.branch2.b_bn.running_var: (32,)
[03/27 20:46:58] pa.utils.checkpoint INFO: t_res2_2_branch2c_bn_subsample_bn_b: (64,) => s2_fuse.bn.bias: (64,)
[03/27 20:46:58] pa.utils.checkpoint INFO: t_res4_5_branch2c_bn_b: (128,) => s4.pathway1_res5.branch2.c_bn.bias: (128,)
[03/27 20:46:58] pa.utils.checkpoint INFO: t_res4_5_branch2b_bn_b: (32,) => s4.pathway1_res5.branch2.b_bn.bias: (32,)
[03/27 20:46:58] pa.utils.checkpoint INFO: res2_0_branch2c_bn_rm: (256,) => s2.pathway0_res0.branch2.c_bn.running_mean: (256,)
[03/27 20:46:58] pa.utils.checkpoint INFO: res5_0_branch1_w: (2048, 1280, 1, 1, 1) => s5.pathway0_res0.branch1.weight: (2048, 1280, 1, 1, 1)
[03/27 20:46:58] pa.utils.checkpoint INFO: res2_2_branch2c_bn_b: (256,) => s2.pathway0_res2.branch2.c_bn.bias: (256,)
[03/27 20:46:58] pa.utils.checkpoint INFO: res4_5_branch2c_bn_b: (1024,) => s4.pathway0_res5.branch2.c_bn.bias: (1024,)
[03/27 20:46:58] pa.utils.checkpoint INFO: t_res2_2_branch2c_bn_subsample_bn_rm: (64,) => s2_fuse.bn.running_mean: (64,)
[03/27 20:46:58] pa.utils.checkpoint INFO: t_res2_0_branch2c_bn_b: (32,) => s2.pathway1_res0.branch2.c_bn.bias: (32,)
[03/27 20:46:58] pa.utils.checkpoint INFO: res4_3_branch2b_bn_s: (256,) => s4.pathway0_res3.branch2.b_bn.weight: (256,)
[03/27 20:46:58] pa.utils.checkpoint INFO: res3_2_branch2c_bn_riv: (512,) => s3.pathway0_res2.branch2.c_bn.running_var: (512,)
[03/27 20:46:58] pa.utils.checkpoint INFO: res5_0_branch2a_w: (512, 1280, 3, 1, 1) => s5.pathway0_res0.branch2.a.weight: (512, 1280, 3, 1, 1)
[03/27 20:46:58] pa.utils.checkpoint INFO: t_res2_0_branch2c_bn_s: (32,) => s2.pathway1_res0.branch2.c_bn.weight: (32,)
[03/27 20:46:58] pa.utils.checkpoint INFO: t_res2_1_branch2a_bn_rm: (8,) => s2.pathway1_res1.branch2.a_bn.running_mean: (8,)
[03/27 20:46:58] pa.utils.checkpoint INFO: res2_0_branch2b_bn_rm: (64,) => s2.pathway0_res0.branch2.b_bn.running_mean: (64,)
[03/27 20:46:58] pa.utils.checkpoint INFO: res2_2_branch2a_bn_rm: (64,) => s2.pathway0_res2.branch2.a_bn.running_mean: (64,)
[03/27 20:46:58] pa.utils.checkpoint INFO: t_res3_0_branch2a_bn_riv: (16,) => s3.pathway1_res0.branch2.a_bn.running_var: (16,)
[03/27 20:46:58] pa.utils.checkpoint INFO: t_res4_5_branch2a_bn_rm: (32,) => s4.pathway1_res5.branch2.a_bn.running_mean: (32,)
[03/27 20:46:58] pa.utils.checkpoint INFO: t_res3_0_branch2b_bn_s: (16,) => s3.pathway1_res0.branch2.b_bn.weight: (16,)
[03/27 20:46:58] pa.utils.checkpoint INFO: t_res3_2_branch2a_bn_b: (16,) => s3.pathway1_res2.branch2.a_bn.bias: (16,)
[03/27 20:46:58] pa.utils.checkpoint INFO: t_res5_2_branch2b_bn_b: (64,) => s5.pathway1_res2.branch2.b_bn.bias: (64,)
[03/27 20:46:58] pa.utils.checkpoint INFO: res5_1_branch2a_bn_rm: (512,) => s5.pathway0_res1.branch2.a_bn.running_mean: (512,)
[03/27 20:46:58] pa.utils.checkpoint INFO: t_res4_0_branch2a_bn_b: (32,) => s4.pathway1_res0.branch2.a_bn.bias: (32,)
[03/27 20:46:58] pa.utils.checkpoint INFO: res5_2_branch2b_w: (512, 512, 1, 3, 3) => s5.pathway0_res2.branch2.b.weight: (512, 512, 1, 3, 3)
[03/27 20:46:58] pa.utils.checkpoint INFO: t_res2_2_branch2a_bn_riv: (8,) => s2.pathway1_res2.branch2.a_bn.running_var: (8,)
[03/27 20:46:58] pa.utils.checkpoint INFO: t_res2_1_branch2c_bn_riv: (32,) => s2.pathway1_res1.branch2.c_bn.running_var: (32,)
[03/27 20:46:58] pa.utils.checkpoint INFO: t_res5_2_branch2b_bn_s: (64,) => s5.pathway1_res2.branch2.b_bn.weight: (64,)
[03/27 20:46:58] pa.utils.checkpoint INFO: t_res5_1_branch2a_bn_rm: (64,) => s5.pathway1_res1.branch2.a_bn.running_mean: (64,)
[03/27 20:46:58] pa.utils.checkpoint INFO: res4_0_branch2c_bn_b: (1024,) => s4.pathway0_res0.branch2.c_bn.bias: (1024,)
[03/27 20:46:58] pa.utils.checkpoint INFO: res4_2_branch2a_w: (256, 1024, 3, 1, 1) => s4.pathway0_res2.branch2.a.weight: (256, 1024, 3, 1, 1)
[03/27 20:46:58] pa.utils.checkpoint INFO: t_res3_0_branch2a_w: (16, 32, 3, 1, 1) => s3.pathway1_res0.branch2.a.weight: (16, 32, 3, 1, 1)
[03/27 20:46:58] pa.utils.checkpoint INFO: res4_0_branch2c_bn_s: (1024,) => s4.pathway0_res0.branch2.c_bn.weight: (1024,)
[03/27 20:46:58] pa.utils.checkpoint INFO: res3_3_branch2b_bn_b: (128,) => s3.pathway0_res3.branch2.b_bn.bias: (128,)
[03/27 20:46:58] pa.utils.checkpoint INFO: res5_1_branch2c_bn_riv: (2048,) => s5.pathway0_res1.branch2.c_bn.running_var: (2048,)
[03/27 20:46:58] pa.utils.checkpoint INFO: t_res3_0_branch1_w: (64, 32, 1, 1, 1) => s3.pathway1_res0.branch1.weight: (64, 32, 1, 1, 1)
[03/27 20:46:58] pa.utils.checkpoint INFO: res4_1_branch2a_bn_riv: (256,) => s4.pathway0_res1.branch2.a_bn.running_var: (256,)
[03/27 20:46:58] pa.utils.checkpoint INFO: conv1_w: (64, 3, 1, 7, 7) => s1.pathway0_stem.conv.weight: (64, 3, 1, 7, 7)
[03/27 20:46:58] pa.utils.checkpoint INFO: res2_0_branch1_bn_riv: (256,) => s2.pathway0_res0.branch1_bn.running_var: (256,)
[03/27 20:46:58] pa.utils.checkpoint INFO: res3_2_branch2c_bn_rm: (512,) => s3.pathway0_res2.branch2.c_bn.running_mean: (512,)
[03/27 20:46:58] pa.utils.checkpoint INFO: res4_4_branch2c_bn_s: (1024,) => s4.pathway0_res4.branch2.c_bn.weight: (1024,)
[03/27 20:46:58] pa.utils.checkpoint INFO: res4_1_branch2c_bn_b: (1024,) => s4.pathway0_res1.branch2.c_bn.bias: (1024,)
[03/27 20:46:58] pa.utils.checkpoint INFO: res5_0_branch2b_bn_rm: (512,) => s5.pathway0_res0.branch2.b_bn.running_mean: (512,)
[03/27 20:46:58] pa.utils.checkpoint INFO: t_res5_1_branch2c_bn_s: (256,) => s5.pathway1_res1.branch2.c_bn.weight: (256,)
[03/27 20:46:58] pa.utils.checkpoint INFO: t_res2_1_branch2c_bn_rm: (32,) => s2.pathway1_res1.branch2.c_bn.running_mean: (32,)
[03/27 20:46:58] pa.utils.checkpoint INFO: res4_4_branch2b_bn_riv: (256,) => s4.pathway0_res4.branch2.b_bn.running_var: (256,)
[03/27 20:46:58] pa.utils.checkpoint INFO: res4_1_branch2c_bn_s: (1024,) => s4.pathway0_res1.branch2.c_bn.weight: (1024,)
[03/27 20:46:58] pa.utils.checkpoint INFO: res4_1_branch2c_bn_riv: (1024,) => s4.pathway0_res1.branch2.c_bn.running_var: (1024,)
[03/27 20:46:58] pa.utils.checkpoint INFO: t_res5_1_branch2c_bn_b: (256,) => s5.pathway1_res1.branch2.c_bn.bias: (256,)
[03/27 20:46:58] pa.utils.checkpoint INFO: res4_5_branch2b_bn_rm: (256,) => s4.pathway0_res5.branch2.b_bn.running_mean: (256,)
[03/27 20:46:58] pa.utils.checkpoint INFO: t_res4_5_branch2a_w: (32, 128, 3, 1, 1) => s4.pathway1_res5.branch2.a.weight: (32, 128, 3, 1, 1)
[03/27 20:46:58] pa.utils.checkpoint INFO: t_res5_0_branch2a_bn_b: (64,) => s5.pathway1_res0.branch2.a_bn.bias: (64,)
[03/27 20:46:58] pa.utils.checkpoint INFO: res3_1_branch2b_bn_rm: (128,) => s3.pathway0_res1.branch2.b_bn.running_mean: (128,)
[03/27 20:46:58] pa.utils.checkpoint INFO: t_pool1_subsample_bn_riv: (16,) => s1_fuse.bn.running_var: (16,)
[03/27 20:46:58] pa.utils.checkpoint INFO: res4_1_branch2a_bn_rm: (256,) => s4.pathway0_res1.branch2.a_bn.running_mean: (256,)
[03/27 20:46:58] pa.utils.checkpoint INFO: t_res5_0_branch2a_bn_s: (64,) => s5.pathway1_res0.branch2.a_bn.weight: (64,)
[03/27 20:46:58] pa.utils.checkpoint INFO: res2_1_branch2c_w: (256, 64, 1, 1, 1) => s2.pathway0_res1.branch2.c.weight: (256, 64, 1, 1, 1)
[03/27 20:46:58] pa.utils.checkpoint INFO: t_res3_1_branch2b_bn_s: (16,) => s3.pathway1_res1.branch2.b_bn.weight: (16,)
[03/27 20:46:58] pa.utils.checkpoint INFO: res2_0_branch2a_bn_riv: (64,) => s2.pathway0_res0.branch2.a_bn.running_var: (64,)
[03/27 20:46:58] pa.utils.checkpoint INFO: res5_0_branch2a_bn_rm: (512,) => s5.pathway0_res0.branch2.a_bn.running_mean: (512,)
[03/27 20:46:58] pa.utils.checkpoint INFO: res3_0_branch2c_bn_riv: (512,) => s3.pathway0_res0.branch2.c_bn.running_var: (512,)
[03/27 20:46:58] pa.utils.checkpoint INFO: res3_2_branch2c_bn_s: (512,) => s3.pathway0_res2.branch2.c_bn.weight: (512,)
[03/27 20:46:58] pa.utils.checkpoint INFO: t_res3_1_branch2b_bn_b: (16,) => s3.pathway1_res1.branch2.b_bn.bias: (16,)
[03/27 20:46:58] pa.utils.checkpoint INFO: t_res3_2_branch2a_bn_s: (16,) => s3.pathway1_res2.branch2.a_bn.weight: (16,)
[03/27 20:46:58] pa.utils.checkpoint INFO: t_res4_4_branch2c_bn_s: (128,) => s4.pathway1_res4.branch2.c_bn.weight: (128,)
[03/27 20:46:58] pa.utils.checkpoint INFO: t_res2_2_branch2b_bn_riv: (8,) => s2.pathway1_res2.branch2.b_bn.running_var: (8,)
[03/27 20:46:58] pa.utils.checkpoint INFO: res5_0_branch2a_bn_b: (512,) => s5.pathway0_res0.branch2.a_bn.bias: (512,)
[03/27 20:46:58] pa.utils.checkpoint INFO: t_res4_4_branch2c_bn_b: (128,) => s4.pathway1_res4.branch2.c_bn.bias: (128,)
[03/27 20:46:58] pa.utils.checkpoint INFO: res3_1_branch2b_bn_b: (128,) => s3.pathway0_res1.branch2.b_bn.bias: (128,)
[03/27 20:46:58] pa.utils.checkpoint INFO: res3_2_branch2a_bn_s: (128,) => s3.pathway0_res2.branch2.a_bn.weight: (128,)
[03/27 20:46:58] pa.utils.checkpoint INFO: t_res4_1_branch2a_bn_rm: (32,) => s4.pathway1_res1.branch2.a_bn.running_mean: (32,)
[03/27 20:46:58] pa.utils.checkpoint INFO: res5_0_branch2a_bn_s: (512,) => s5.pathway0_res0.branch2.a_bn.weight: (512,)
[03/27 20:46:58] pa.utils.checkpoint INFO: t_res5_2_branch2c_bn_b: (256,) => s5.pathway1_res2.branch2.c_bn.bias: (256,)
[03/27 20:46:58] pa.utils.checkpoint INFO: t_res4_0_branch2b_bn_rm: (32,) => s4.pathway1_res0.branch2.b_bn.running_mean: (32,)
[03/27 20:46:58] pa.utils.checkpoint INFO: t_res4_0_branch1_w: (128, 64, 1, 1, 1) => s4.pathway1_res0.branch1.weight: (128, 64, 1, 1, 1)
[03/27 20:46:58] pa.utils.checkpoint INFO: res4_0_branch2b_bn_s: (256,) => s4.pathway0_res0.branch2.b_bn.weight: (256,)
[03/27 20:46:58] pa.utils.checkpoint INFO: res4_2_branch2a_bn_b: (256,) => s4.pathway0_res2.branch2.a_bn.bias: (256,)
[03/27 20:46:58] pa.utils.checkpoint INFO: res5_2_branch2b_bn_s: (512,) => s5.pathway0_res2.branch2.b_bn.weight: (512,)
[03/27 20:46:59] pa.utils.checkpoint INFO: res4_3_branch2c_bn_s: (1024,) => s4.pathway0_res3.branch2.c_bn.weight: (1024,)
[03/27 20:46:59] pa.utils.checkpoint INFO: res4_2_branch2a_bn_s: (256,) => s4.pathway0_res2.branch2.a_bn.weight: (256,)
[03/27 20:46:59] pa.utils.checkpoint INFO: res5_2_branch2b_bn_b: (512,) => s5.pathway0_res2.branch2.b_bn.bias: (512,)
[03/27 20:46:59] pa.utils.checkpoint INFO: t_res2_0_branch1_bn_rm: (32,) => s2.pathway1_res0.branch1_bn.running_mean: (32,)
[03/27 20:46:59] pa.utils.checkpoint INFO: res3_1_branch2a_w: (128, 512, 1, 1, 1) => s3.pathway0_res1.branch2.a.weight: (128, 512, 1, 1, 1)
[03/27 20:46:59] pa.utils.checkpoint INFO: res4_3_branch2c_bn_b: (1024,) => s4.pathway0_res3.branch2.c_bn.bias: (1024,)
[03/27 20:46:59] pa.utils.checkpoint INFO: res3_0_branch2b_bn_rm: (128,) => s3.pathway0_res0.branch2.b_bn.running_mean: (128,)
[03/27 20:46:59] pa.utils.checkpoint INFO: res5_0_branch2b_bn_b: (512,) => s5.pathway0_res0.branch2.b_bn.bias: (512,)
[03/27 20:46:59] pa.utils.checkpoint INFO: res5_2_branch2a_bn_b: (512,) => s5.pathway0_res2.branch2.a_bn.bias: (512,)
[03/27 20:46:59] pa.utils.checkpoint INFO: t_res5_0_branch1_bn_riv: (256,) => s5.pathway1_res0.branch1_bn.running_var: (256,)
[03/27 20:46:59] pa.utils.checkpoint INFO: t_res3_0_branch1_bn_b: (64,) => s3.pathway1_res0.branch1_bn.bias: (64,)
[03/27 20:46:59] pa.utils.checkpoint INFO: t_res4_1_branch2c_bn_b: (128,) => s4.pathway1_res1.branch2.c_bn.bias: (128,)
[03/27 20:46:59] pa.utils.checkpoint INFO: res3_3_branch2a_bn_b: (128,) => s3.pathway0_res3.branch2.a_bn.bias: (128,)
[03/27 20:46:59] pa.utils.checkpoint INFO: t_res4_0_branch2c_w: (128, 32, 1, 1, 1) => s4.pathway1_res0.branch2.c.weight: (128, 32, 1, 1, 1)
[03/27 20:46:59] pa.utils.checkpoint INFO: res5_2_branch2a_bn_s: (512,) => s5.pathway0_res2.branch2.a_bn.weight: (512,)
[03/27 20:46:59] pa.utils.checkpoint INFO: t_res3_0_branch1_bn_s: (64,) => s3.pathway1_res0.branch1_bn.weight: (64,)
[03/27 20:46:59] pa.utils.checkpoint INFO: t_res4_1_branch2c_bn_s: (128,) => s4.pathway1_res1.branch2.c_bn.weight: (128,)
[03/27 20:46:59] pa.utils.checkpoint INFO: res2_0_branch1_bn_b: (256,) => s2.pathway0_res0.branch1_bn.bias: (256,)
[03/27 20:46:59] pa.utils.checkpoint INFO: res4_1_branch2b_bn_riv: (256,) => s4.pathway0_res1.branch2.b_bn.running_var: (256,)
[03/27 20:46:59] pa.utils.checkpoint INFO: res4_0_branch2b_bn_rm: (256,) => s4.pathway0_res0.branch2.b_bn.running_mean: (256,)
[03/27 20:46:59] pa.utils.checkpoint INFO: res4_0_branch1_bn_s: (1024,) => s4.pathway0_res0.branch1_bn.weight: (1024,)
[03/27 20:46:59] pa.utils.checkpoint INFO: t_res3_2_branch2c_w: (64, 16, 1, 1, 1) => s3.pathway1_res2.branch2.c.weight: (64, 16, 1, 1, 1)
[03/27 20:46:59] pa.utils.checkpoint INFO: res4_2_branch2a_bn_rm: (256,) => s4.pathway0_res2.branch2.a_bn.running_mean: (256,)
[03/27 20:46:59] pa.utils.checkpoint INFO: t_res4_0_branch1_bn_b: (128,) => s4.pathway1_res0.branch1_bn.bias: (128,)
[03/27 20:46:59] pa.utils.checkpoint INFO: res2_0_branch2b_w: (64, 64, 1, 3, 3) => s2.pathway0_res0.branch2.b.weight: (64, 64, 1, 3, 3)
[03/27 20:46:59] pa.utils.checkpoint INFO: t_res5_0_branch2c_w: (256, 64, 1, 1, 1) => s5.pathway1_res0.branch2.c.weight: (256, 64, 1, 1, 1)
[03/27 20:46:59] pa.utils.checkpoint INFO: res4_0_branch1_bn_b: (1024,) => s4.pathway0_res0.branch1_bn.bias: (1024,)
[03/27 20:46:59] pa.utils.checkpoint INFO: res4_1_branch2c_bn_rm: (1024,) => s4.pathway0_res1.branch2.c_bn.running_mean: (1024,)
[03/27 20:46:59] pa.utils.checkpoint INFO: t_res4_2_branch2b_bn_riv: (32,) => s4.pathway1_res2.branch2.b_bn.running_var: (32,)
[03/27 20:46:59] pa.utils.checkpoint INFO: res4_3_branch2a_bn_rm: (256,) => s4.pathway0_res3.branch2.a_bn.running_mean: (256,)
[03/27 20:46:59] pa.utils.checkpoint INFO: t_res5_0_branch2b_bn_b: (64,) => s5.pathway1_res0.branch2.b_bn.bias: (64,)
[03/27 20:46:59] pa.utils.checkpoint INFO: t_res4_1_branch2a_w: (32, 128, 3, 1, 1) => s4.pathway1_res1.branch2.a.weight: (32, 128, 3, 1, 1)
[03/27 20:46:59] pa.utils.checkpoint INFO: res4_2_branch2b_w: (256, 256, 1, 3, 3) => s4.pathway0_res2.branch2.b.weight: (256, 256, 1, 3, 3)
[03/27 20:46:59] pa.utils.checkpoint INFO: res3_1_branch2c_w: (512, 128, 1, 1, 1) => s3.pathway0_res1.branch2.c.weight: (512, 128, 1, 1, 1)
[03/27 20:46:59] pa.utils.checkpoint INFO: res3_0_branch1_bn_s: (512,) => s3.pathway0_res0.branch1_bn.weight: (512,)
[03/27 20:46:59] pa.utils.checkpoint INFO: t_res3_1_branch2b_w: (16, 16, 1, 3, 3) => s3.pathway1_res1.branch2.b.weight: (16, 16, 1, 3, 3)
[03/27 20:46:59] pa.utils.checkpoint INFO: t_res5_0_branch2a_w: (64, 128, 3, 1, 1) => s5.pathway1_res0.branch2.a.weight: (64, 128, 3, 1, 1)
[03/27 20:46:59] pa.utils.checkpoint INFO: t_res5_2_branch2a_bn_rm: (64,) => s5.pathway1_res2.branch2.a_bn.running_mean: (64,)
[03/27 20:46:59] pa.utils.checkpoint INFO: t_res4_1_branch2c_w: (128, 32, 1, 1, 1) => s4.pathway1_res1.branch2.c.weight: (128, 32, 1, 1, 1)
[03/27 20:46:59] pa.utils.checkpoint INFO: t_res3_2_branch2c_bn_s: (64,) => s3.pathway1_res2.branch2.c_bn.weight: (64,)
[03/27 20:46:59] pa.utils.checkpoint INFO: res4_1_branch2b_bn_rm: (256,) => s4.pathway0_res1.branch2.b_bn.running_mean: (256,)
[03/27 20:46:59] pa.utils.checkpoint INFO: res3_3_branch2b_bn_rm: (128,) => s3.pathway0_res3.branch2.b_bn.running_mean: (128,)
[03/27 20:46:59] pa.utils.checkpoint INFO: res4_0_branch2b_bn_b: (256,) => s4.pathway0_res0.branch2.b_bn.bias: (256,)
[03/27 20:46:59] pa.utils.checkpoint INFO: t_res3_2_branch2c_bn_b: (64,) => s3.pathway1_res2.branch2.c_bn.bias: (64,)
[03/27 20:46:59] pa.utils.checkpoint INFO: t_res5_1_branch2a_bn_riv: (64,) => s5.pathway1_res1.branch2.a_bn.running_var: (64,)
[03/27 20:46:59] pa.utils.checkpoint INFO: res3_1_branch2c_bn_riv: (512,) => s3.pathway0_res1.branch2.c_bn.running_var: (512,)
[03/27 20:46:59] pa.utils.checkpoint INFO: res3_0_branch2c_bn_b: (512,) => s3.pathway0_res0.branch2.c_bn.bias: (512,)
[03/27 20:46:59] pa.utils.checkpoint INFO: res3_0_branch2a_bn_rm: (128,) => s3.pathway0_res0.branch2.a_bn.running_mean: (128,)
[03/27 20:46:59] pa.utils.checkpoint INFO: res5_0_branch2c_bn_s: (2048,) => s5.pathway0_res0.branch2.c_bn.weight: (2048,)
[03/27 20:46:59] pa.utils.checkpoint INFO: t_res5_1_branch2a_bn_b: (64,) => s5.pathway1_res1.branch2.a_bn.bias: (64,)
[03/27 20:46:59] pa.utils.checkpoint INFO: res4_2_branch2b_bn_rm: (256,) => s4.pathway0_res2.branch2.b_bn.running_mean: (256,)
[03/27 20:46:59] pa.utils.checkpoint INFO: res5_2_branch2c_w: (2048, 512, 1, 1, 1) => s5.pathway0_res2.branch2.c.weight: (2048, 512, 1, 1, 1)
[03/27 20:46:59] pa.utils.checkpoint INFO: res3_0_branch2c_bn_s: (512,) => s3.pathway0_res0.branch2.c_bn.weight: (512,)
[03/27 20:46:59] pa.utils.checkpoint INFO: res2_0_branch2a_w: (64, 80, 1, 1, 1) => s2.pathway0_res0.branch2.a.weight: (64, 80, 1, 1, 1)
[03/27 20:46:59] pa.utils.checkpoint INFO: t_res5_1_branch2a_bn_s: (64,) => s5.pathway1_res1.branch2.a_bn.weight: (64,)
[03/27 20:46:59] pa.utils.checkpoint INFO: res4_3_branch2a_bn_riv: (256,) => s4.pathway0_res3.branch2.a_bn.running_var: (256,)
[03/27 20:46:59] pa.utils.checkpoint INFO: t_res3_1_branch2c_bn_s: (64,) => s3.pathway1_res1.branch2.c_bn.weight: (64,)
[03/27 20:46:59] pa.utils.checkpoint INFO: t_res5_2_branch2c_bn_s: (256,) => s5.pathway1_res2.branch2.c_bn.weight: (256,)
[03/27 20:46:59] pa.utils.checkpoint INFO: t_res4_4_branch2a_w: (32, 128, 3, 1, 1) => s4.pathway1_res4.branch2.a.weight: (32, 128, 3, 1, 1)
[03/27 20:46:59] pa.utils.checkpoint INFO: t_res3_1_branch2c_bn_b: (64,) => s3.pathway1_res1.branch2.c_bn.bias: (64,)
[03/27 20:46:59] pa.utils.checkpoint INFO: t_res4_3_branch2c_w: (128, 32, 1, 1, 1) => s4.pathway1_res3.branch2.c.weight: (128, 32, 1, 1, 1)
[03/27 20:46:59] pa.utils.checkpoint INFO: t_res4_0_branch2b_w: (32, 32, 1, 3, 3) => s4.pathway1_res0.branch2.b.weight: (32, 32, 1, 3, 3)
[03/27 20:46:59] pa.utils.checkpoint INFO: res2_2_branch2c_w: (256, 64, 1, 1, 1) => s2.pathway0_res2.branch2.c.weight: (256, 64, 1, 1, 1)
[03/27 20:46:59] pa.utils.checkpoint INFO: t_res5_2_branch2a_bn_riv: (64,) => s5.pathway1_res2.branch2.a_bn.running_var: (64,)
[03/27 20:46:59] pa.utils.checkpoint INFO: t_res4_0_branch2b_bn_b: (32,) => s4.pathway1_res0.branch2.b_bn.bias: (32,)
[03/27 20:46:59] pa.utils.checkpoint INFO: t_res4_0_branch2c_bn_riv: (128,) => s4.pathway1_res0.branch2.c_bn.running_var: (128,)
[03/27 20:46:59] pa.utils.checkpoint INFO: t_res4_0_branch2b_bn_s: (32,) => s4.pathway1_res0.branch2.b_bn.weight: (32,)
[03/27 20:46:59] pa.utils.checkpoint INFO: t_res3_0_branch2b_w: (16, 16, 1, 3, 3) => s3.pathway1_res0.branch2.b.weight: (16, 16, 1, 3, 3)
[03/27 20:46:59] pa.utils.checkpoint INFO: t_res4_3_branch2a_w: (32, 128, 3, 1, 1) => s4.pathway1_res3.branch2.a.weight: (32, 128, 3, 1, 1)
[03/27 20:46:59] pa.utils.checkpoint INFO: res3_0_branch2b_bn_riv: (128,) => s3.pathway0_res0.branch2.b_bn.running_var: (128,)
[03/27 20:46:59] pa.utils.checkpoint INFO: res3_0_branch2a_w: (128, 320, 1, 1, 1) => s3.pathway0_res0.branch2.a.weight: (128, 320, 1, 1, 1)
[03/27 20:46:59] pa.utils.checkpoint INFO: res2_0_branch2a_bn_s: (64,) => s2.pathway0_res0.branch2.a_bn.weight: (64,)
[03/27 20:47:00] pa.utils.checkpoint INFO: t_res3_1_branch2c_bn_rm: (64,) => s3.pathway1_res1.branch2.c_bn.running_mean: (64,)
[03/27 20:47:00] pa.utils.checkpoint INFO: t_res_conv1_bn_b: (8,) => s1.pathway1_stem.bn.bias: (8,)
[03/27 20:47:00] pa.utils.checkpoint INFO: res4_3_branch2a_bn_b: (256,) => s4.pathway0_res3.branch2.a_bn.bias: (256,)
[03/27 20:47:00] pa.utils.checkpoint INFO: res2_1_branch2a_bn_s: (64,) => s2.pathway0_res1.branch2.a_bn.weight: (64,)
[03/27 20:47:00] pa.utils.checkpoint INFO: res3_0_branch2c_bn_rm: (512,) => s3.pathway0_res0.branch2.c_bn.running_mean: (512,)
[03/27 20:47:00] pa.utils.checkpoint INFO: t_res2_1_branch2b_bn_s: (8,) => s2.pathway1_res1.branch2.b_bn.weight: (8,)
[03/27 20:47:00] pa.utils.checkpoint INFO: res3_2_branch2c_bn_b: (512,) => s3.pathway0_res2.branch2.c_bn.bias: (512,)
[03/27 20:47:00] pa.utils.checkpoint INFO: res2_0_branch2a_bn_b: (64,) => s2.pathway0_res0.branch2.a_bn.bias: (64,)
[03/27 20:47:00] pa.utils.checkpoint INFO: res4_1_branch2b_bn_s: (256,) => s4.pathway0_res1.branch2.b_bn.weight: (256,)
[03/27 20:47:00] pa.utils.checkpoint INFO: t_res3_2_branch2b_w: (16, 16, 1, 3, 3) => s3.pathway1_res2.branch2.b.weight: (16, 16, 1, 3, 3)
[03/27 20:47:00] pa.utils.checkpoint INFO: t_res_conv1_bn_s: (8,) => s1.pathway1_stem.bn.weight: (8,)
[03/27 20:47:00] pa.utils.checkpoint INFO: t_res2_0_branch2b_bn_rm: (8,) => s2.pathway1_res0.branch2.b_bn.running_mean: (8,)
[03/27 20:47:00] pa.utils.checkpoint INFO: res2_1_branch2a_bn_b: (64,) => s2.pathway0_res1.branch2.a_bn.bias: (64,)
[03/27 20:47:00] pa.utils.checkpoint INFO: res4_2_branch2c_w: (1024, 256, 1, 1, 1) => s4.pathway0_res2.branch2.c.weight: (1024, 256, 1, 1, 1)
[03/27 20:47:00] pa.utils.checkpoint INFO: res3_1_branch2b_w: (128, 128, 1, 3, 3) => s3.pathway0_res1.branch2.b.weight: (128, 128, 1, 3, 3)
[03/27 20:47:00] pa.utils.checkpoint INFO: res3_1_branch2a_bn_rm: (128,) => s3.pathway0_res1.branch2.a_bn.running_mean: (128,)
[03/27 20:47:00] pa.utils.checkpoint INFO: res3_3_branch2c_w: (512, 128, 1, 1, 1) => s3.pathway0_res3.branch2.c.weight: (512, 128, 1, 1, 1)
[03/27 20:47:00] pa.utils.checkpoint INFO: t_res4_1_branch2a_bn_riv: (32,) => s4.pathway1_res1.branch2.a_bn.running_var: (32,)
[03/27 20:47:00] pa.utils.checkpoint INFO: t_res4_4_branch2b_w: (32, 32, 1, 3, 3) => s4.pathway1_res4.branch2.b.weight: (32, 32, 1, 3, 3)
[03/27 20:47:00] pa.utils.checkpoint INFO: res2_0_branch2a_bn_rm: (64,) => s2.pathway0_res0.branch2.a_bn.running_mean: (64,)
[03/27 20:47:00] pa.utils.checkpoint INFO: res2_0_branch2b_bn_b: (64,) => s2.pathway0_res0.branch2.b_bn.bias: (64,)
[03/27 20:47:00] pa.utils.checkpoint INFO: res4_0_branch2a_bn_s: (256,) => s4.pathway0_res0.branch2.a_bn.weight: (256,)
[03/27 20:47:00] pa.utils.checkpoint INFO: res3_3_branch2a_w: (128, 512, 1, 1, 1) => s3.pathway0_res3.branch2.a.weight: (128, 512, 1, 1, 1)
[03/27 20:47:00] pa.utils.checkpoint INFO: t_res4_4_branch2c_w: (128, 32, 1, 1, 1) => s4.pathway1_res4.branch2.c.weight: (128, 32, 1, 1, 1)
[03/27 20:47:00] pa.utils.checkpoint INFO: res3_1_branch2b_bn_riv: (128,) => s3.pathway0_res1.branch2.b_bn.running_var: (128,)
[03/27 20:47:00] pa.utils.checkpoint INFO: res2_0_branch2b_bn_s: (64,) => s2.pathway0_res0.branch2.b_bn.weight: (64,)
[03/27 20:47:00] pa.utils.checkpoint INFO: res4_0_branch2a_bn_b: (256,) => s4.pathway0_res0.branch2.a_bn.bias: (256,)
[03/27 20:47:00] pa.utils.checkpoint INFO: t_res5_0_branch2c_bn_s: (256,) => s5.pathway1_res0.branch2.c_bn.weight: (256,)
[03/27 20:47:00] pa.utils.checkpoint INFO: t_res4_5_branch2c_bn_rm: (128,) => s4.pathway1_res5.branch2.c_bn.running_mean: (128,)
[03/27 20:47:00] pa.utils.checkpoint INFO: t_res3_3_branch2c_bn_subsample_w: (128, 64, 7, 1, 1) => s3_fuse.conv_f2s.weight: (128, 64, 7, 1, 1)
[03/27 20:47:00] pa.utils.checkpoint INFO: res2_0_branch2c_bn_b: (256,) => s2.pathway0_res0.branch2.c_bn.bias: (256,)
[03/27 20:47:00] pa.utils.checkpoint INFO: t_res2_2_branch2c_bn_rm: (32,) => s2.pathway1_res2.branch2.c_bn.running_mean: (32,)
[03/27 20:47:00] pa.utils.checkpoint INFO: res5_0_branch2b_bn_s: (512,) => s5.pathway0_res0.branch2.b_bn.weight: (512,)
[03/27 20:47:00] pa.utils.checkpoint INFO: t_res5_0_branch2a_bn_rm: (64,) => s5.pathway1_res0.branch2.a_bn.running_mean: (64,)
[03/27 20:47:00] pa.utils.checkpoint INFO: res5_2_branch2c_bn_rm: (2048,) => s5.pathway0_res2.branch2.c_bn.running_mean: (2048,)
[03/27 20:47:00] pa.utils.checkpoint INFO: res2_0_branch2c_bn_s: (256,) => s2.pathway0_res0.branch2.c_bn.weight: (256,)
[03/27 20:47:00] pa.utils.checkpoint INFO: res4_3_branch2b_w: (256, 256, 1, 3, 3) => s4.pathway0_res3.branch2.b.weight: (256, 256, 1, 3, 3)
[03/27 20:47:00] pa.utils.checkpoint INFO: t_res5_0_branch2a_bn_riv: (64,) => s5.pathway1_res0.branch2.a_bn.running_var: (64,)
[03/27 20:47:00] pa.utils.checkpoint INFO: res3_0_branch2b_bn_b: (128,) => s3.pathway0_res0.branch2.b_bn.bias: (128,)
[03/27 20:47:00] pa.utils.checkpoint INFO: t_res3_1_branch2c_bn_riv: (64,) => s3.pathway1_res1.branch2.c_bn.running_var: (64,)
[03/27 20:47:00] pa.utils.checkpoint INFO: res4_1_branch2b_bn_b: (256,) => s4.pathway0_res1.branch2.b_bn.bias: (256,)
[03/27 20:47:00] pa.utils.checkpoint INFO: res4_0_branch2a_bn_rm: (256,) => s4.pathway0_res0.branch2.a_bn.running_mean: (256,)
[03/27 20:47:00] pa.utils.checkpoint INFO: t_res4_3_branch2c_bn_s: (128,) => s4.pathway1_res3.branch2.c_bn.weight: (128,)
[03/27 20:47:00] pa.utils.checkpoint INFO: res2_0_branch1_w: (256, 80, 1, 1, 1) => s2.pathway0_res0.branch1.weight: (256, 80, 1, 1, 1)
[03/27 20:47:00] pa.utils.checkpoint INFO: t_res5_0_branch2c_bn_b: (256,) => s5.pathway1_res0.branch2.c_bn.bias: (256,)
[03/27 20:47:00] pa.utils.checkpoint INFO: res2_1_branch2c_bn_rm: (256,) => s2.pathway0_res1.branch2.c_bn.running_mean: (256,)
[03/27 20:47:00] pa.utils.checkpoint INFO: res3_0_branch2b_bn_s: (128,) => s3.pathway0_res0.branch2.b_bn.weight: (128,)
[03/27 20:47:00] pa.utils.checkpoint INFO: t_res3_3_branch2b_w: (16, 16, 1, 3, 3) => s3.pathway1_res3.branch2.b.weight: (16, 16, 1, 3, 3)
[03/27 20:47:00] pa.utils.checkpoint INFO: t_res4_0_branch2a_bn_rm: (32,) => s4.pathway1_res0.branch2.a_bn.running_mean: (32,)
[03/27 20:47:00] pa.utils.checkpoint INFO: t_res3_3_branch2c_bn_rm: (64,) => s3.pathway1_res3.branch2.c_bn.running_mean: (64,)
[03/27 20:47:00] pa.utils.checkpoint INFO: t_res4_3_branch2c_bn_b: (128,) => s4.pathway1_res3.branch2.c_bn.bias: (128,)
[03/27 20:47:00] pa.utils.checkpoint INFO: t_res4_2_branch2a_bn_s: (32,) => s4.pathway1_res2.branch2.a_bn.weight: (32,)
[03/27 20:47:00] pa.utils.checkpoint INFO: res4_5_branch2c_bn_rm: (1024,) => s4.pathway0_res5.branch2.c_bn.running_mean: (1024,)
[03/27 20:47:00] pa.utils.checkpoint INFO: t_res4_2_branch2a_bn_b: (32,) => s4.pathway1_res2.branch2.a_bn.bias: (32,)
[03/27 20:47:00] pa.utils.checkpoint INFO: res_conv1_bn_riv: (64,) => s1.pathway0_stem.bn.running_var: (64,)
[03/27 20:47:00] pa.utils.checkpoint INFO: t_res5_0_branch1_bn_b: (256,) => s5.pathway1_res0.branch1_bn.bias: (256,)
[03/27 20:47:00] pa.utils.checkpoint INFO: t_res4_4_branch2c_bn_riv: (128,) => s4.pathway1_res4.branch2.c_bn.running_var: (128,)
[03/27 20:47:00] pa.utils.checkpoint INFO: t_res4_0_branch2a_w: (32, 64, 3, 1, 1) => s4.pathway1_res0.branch2.a.weight: (32, 64, 3, 1, 1)
[03/27 20:47:00] pa.utils.checkpoint INFO: t_res4_5_branch2c_bn_riv: (128,) => s4.pathway1_res5.branch2.c_bn.running_var: (128,)
[03/27 20:47:00] pa.utils.checkpoint INFO: res4_4_branch2a_bn_riv: (256,) => s4.pathway0_res4.branch2.a_bn.running_var: (256,)
[03/27 20:47:00] pa.utils.checkpoint INFO: t_res2_1_branch2a_bn_riv: (8,) => s2.pathway1_res1.branch2.a_bn.running_var: (8,)
[03/27 20:47:00] pa.utils.checkpoint INFO: t_res5_1_branch2c_w: (256, 64, 1, 1, 1) => s5.pathway1_res1.branch2.c.weight: (256, 64, 1, 1, 1)
[03/27 20:47:00] pa.utils.checkpoint INFO: res5_1_branch2b_bn_riv: (512,) => s5.pathway0_res1.branch2.b_bn.running_var: (512,)
[03/27 20:47:00] pa.utils.checkpoint INFO: t_res2_0_branch2c_bn_rm: (32,) => s2.pathway1_res0.branch2.c_bn.running_mean: (32,)
[03/27 20:47:00] pa.utils.checkpoint INFO: t_res3_3_branch2c_bn_subsample_bn_riv: (128,) => s3_fuse.bn.running_var: (128,)
[03/27 20:47:00] pa.utils.checkpoint INFO: res5_2_branch2c_bn_s: (2048,) => s5.pathway0_res2.branch2.c_bn.weight: (2048,)
[03/27 20:47:00] pa.utils.checkpoint INFO: t_res3_3_branch2a_bn_b: (16,) => s3.pathway1_res3.branch2.a_bn.bias: (16,)
[03/27 20:47:00] pa.utils.checkpoint INFO: res2_0_branch2c_w: (256, 64, 1, 1, 1) => s2.pathway0_res0.branch2.c.weight: (256, 64, 1, 1, 1)
[03/27 20:47:00] pa.utils.checkpoint INFO: res5_2_branch2c_bn_b: (2048,) => s5.pathway0_res2.branch2.c_bn.bias: (2048,)
[03/27 20:47:00] pa.utils.checkpoint INFO: t_res4_5_branch2a_bn_b: (32,) => s4.pathway1_res5.branch2.a_bn.bias: (32,)
[03/27 20:47:00] pa.utils.checkpoint INFO: t_res5_0_branch2b_w: (64, 64, 1, 3, 3) => s5.pathway1_res0.branch2.b.weight: (64, 64, 1, 3, 3)
[03/27 20:47:01] pa.utils.checkpoint INFO: t_res4_5_branch2c_w: (128, 32, 1, 1, 1) => s4.pathway1_res5.branch2.c.weight: (128, 32, 1, 1, 1)
[03/27 20:47:01] pa.utils.checkpoint INFO: t_res2_2_branch2a_bn_s: (8,) => s2.pathway1_res2.branch2.a_bn.weight: (8,)
[03/27 20:47:01] pa.utils.checkpoint INFO: res4_3_branch2a_bn_s: (256,) => s4.pathway0_res3.branch2.a_bn.weight: (256,)
[03/27 20:47:01] pa.utils.checkpoint INFO: res4_3_branch2b_bn_riv: (256,) => s4.pathway0_res3.branch2.b_bn.running_var: (256,)
[03/27 20:47:01] pa.utils.checkpoint INFO: res4_2_branch2c_bn_riv: (1024,) => s4.pathway0_res2.branch2.c_bn.running_var: (1024,)
[03/27 20:47:01] pa.utils.checkpoint INFO: res4_1_branch2b_w: (256, 256, 1, 3, 3) => s4.pathway0_res1.branch2.b.weight: (256, 256, 1, 3, 3)
[03/27 20:47:01] pa.utils.checkpoint INFO: t_res5_2_branch2b_bn_rm: (64,) => s5.pathway1_res2.branch2.b_bn.running_mean: (64,)
[03/27 20:47:01] pa.utils.checkpoint INFO: res2_1_branch2a_w: (64, 256, 1, 1, 1) => s2.pathway0_res1.branch2.a.weight: (64, 256, 1, 1, 1)
[03/27 20:47:01] pa.utils.checkpoint INFO: res4_0_branch1_w: (1024, 640, 1, 1, 1) => s4.pathway0_res0.branch1.weight: (1024, 640, 1, 1, 1)
[03/27 20:47:01] pa.utils.checkpoint INFO: t_res4_2_branch2b_bn_b: (32,) => s4.pathway1_res2.branch2.b_bn.bias: (32,)
[03/27 20:47:01] pa.utils.checkpoint INFO: t_res4_2_branch2b_w: (32, 32, 1, 3, 3) => s4.pathway1_res2.branch2.b.weight: (32, 32, 1, 3, 3)
[03/27 20:47:01] pa.utils.checkpoint INFO: t_res_conv1_bn_riv: (8,) => s1.pathway1_stem.bn.running_var: (8,)
[03/27 20:47:01] pa.utils.checkpoint INFO: t_res3_3_branch2b_bn_s: (16,) => s3.pathway1_res3.branch2.b_bn.weight: (16,)
[03/27 20:47:01] pa.utils.checkpoint INFO: t_res4_2_branch2b_bn_s: (32,) => s4.pathway1_res2.branch2.b_bn.weight: (32,)
[03/27 20:47:01] pa.utils.checkpoint INFO: res3_3_branch2c_bn_rm: (512,) => s3.pathway0_res3.branch2.c_bn.running_mean: (512,)
[03/27 20:47:01] pa.utils.checkpoint INFO: t_res4_0_branch2a_bn_riv: (32,) => s4.pathway1_res0.branch2.a_bn.running_var: (32,)
[03/27 20:47:01] pa.utils.checkpoint INFO: res3_2_branch2b_bn_riv: (128,) => s3.pathway0_res2.branch2.b_bn.running_var: (128,)
[03/27 20:47:01] pa.utils.checkpoint INFO: t_res3_3_branch2b_bn_b: (16,) => s3.pathway1_res3.branch2.b_bn.bias: (16,)
[03/27 20:47:01] pa.utils.checkpoint INFO: t_res2_0_branch2b_w: (8, 8, 1, 3, 3) => s2.pathway1_res0.branch2.b.weight: (8, 8, 1, 3, 3)
[03/27 20:47:01] pa.utils.checkpoint INFO: res3_3_branch2a_bn_rm: (128,) => s3.pathway0_res3.branch2.a_bn.running_mean: (128,)
[03/27 20:47:01] pa.utils.checkpoint INFO: res5_0_branch1_bn_riv: (2048,) => s5.pathway0_res0.branch1_bn.running_var: (2048,)
[03/27 20:47:01] pa.utils.checkpoint INFO: res4_4_branch2c_bn_riv: (1024,) => s4.pathway0_res4.branch2.c_bn.running_var: (1024,)
[03/27 20:47:01] pa.utils.checkpoint INFO: t_res4_1_branch2b_bn_rm: (32,) => s4.pathway1_res1.branch2.b_bn.running_mean: (32,)
[03/27 20:47:01] pa.utils.checkpoint INFO: res2_2_branch2b_bn_s: (64,) => s2.pathway0_res2.branch2.b_bn.weight: (64,)
[03/27 20:47:01] pa.utils.checkpoint INFO: res_conv1_bn_rm: (64,) => s1.pathway0_stem.bn.running_mean: (64,)
[03/27 20:47:01] pa.utils.checkpoint INFO: res3_2_branch2b_bn_rm: (128,) => s3.pathway0_res2.branch2.b_bn.running_mean: (128,)
[03/27 20:47:01] pa.utils.checkpoint INFO: res2_2_branch2b_bn_b: (64,) => s2.pathway0_res2.branch2.b_bn.bias: (64,)
[03/27 20:47:01] pa.utils.checkpoint INFO: res2_2_branch2c_bn_s: (256,) => s2.pathway0_res2.branch2.c_bn.weight: (256,)
[03/27 20:47:01] pa.utils.checkpoint INFO: t_res4_1_branch2c_bn_rm: (128,) => s4.pathway1_res1.branch2.c_bn.running_mean: (128,)
[03/27 20:47:01] pa.utils.checkpoint INFO: t_conv1_w: (8, 3, 5, 7, 7) => s1.pathway1_stem.conv.weight: (8, 3, 5, 7, 7)
[03/27 20:47:01] pa.utils.checkpoint INFO: res4_0_branch1_bn_rm: (1024,) => s4.pathway0_res0.branch1_bn.running_mean: (1024,)
[03/27 20:47:01] pa.utils.checkpoint INFO: res4_0_branch1_bn_riv: (1024,) => s4.pathway0_res0.branch1_bn.running_var: (1024,)
[03/27 20:47:01] pa.utils.checkpoint INFO: res5_0_branch1_bn_s: (2048,) => s5.pathway0_res0.branch1_bn.weight: (2048,)
[03/27 20:47:01] pa.utils.checkpoint INFO: res4_0_branch2c_w: (1024, 256, 1, 1, 1) => s4.pathway0_res0.branch2.c.weight: (1024, 256, 1, 1, 1)
[03/27 20:47:01] pa.utils.checkpoint INFO: res2_1_branch2b_bn_b: (64,) => s2.pathway0_res1.branch2.b_bn.bias: (64,)
[03/27 20:47:01] pa.utils.checkpoint INFO: res5_0_branch2c_bn_rm: (2048,) => s5.pathway0_res0.branch2.c_bn.running_mean: (2048,)
[03/27 20:47:01] pa.utils.checkpoint INFO: res5_0_branch1_bn_b: (2048,) => s5.pathway0_res0.branch1_bn.bias: (2048,)
[03/27 20:47:01] pa.utils.checkpoint INFO: res4_0_branch2a_w: (256, 640, 3, 1, 1) => s4.pathway0_res0.branch2.a.weight: (256, 640, 3, 1, 1)
[03/27 20:47:01] pa.utils.checkpoint INFO: res4_4_branch2c_bn_rm: (1024,) => s4.pathway0_res4.branch2.c_bn.running_mean: (1024,)
[03/27 20:47:01] pa.utils.checkpoint INFO: res2_1_branch2b_bn_s: (64,) => s2.pathway0_res1.branch2.b_bn.weight: (64,)
[03/27 20:47:01] pa.utils.checkpoint INFO: t_res5_0_branch2c_bn_rm: (256,) => s5.pathway1_res0.branch2.c_bn.running_mean: (256,)
[03/27 20:47:01] pa.utils.checkpoint INFO: t_res3_1_branch2a_bn_rm: (16,) => s3.pathway1_res1.branch2.a_bn.running_mean: (16,)
[03/27 20:47:01] pa.utils.checkpoint INFO: t_res4_3_branch2b_w: (32, 32, 1, 3, 3) => s4.pathway1_res3.branch2.b.weight: (32, 32, 1, 3, 3)
[03/27 20:47:01] pa.utils.checkpoint INFO: res3_0_branch1_bn_riv: (512,) => s3.pathway0_res0.branch1_bn.running_var: (512,)
[03/27 20:47:01] pa.utils.checkpoint INFO: t_res2_2_branch2a_bn_b: (8,) => s2.pathway1_res2.branch2.a_bn.bias: (8,)
[03/27 20:47:01] pa.utils.checkpoint INFO: res4_3_branch2c_w: (1024, 256, 1, 1, 1) => s4.pathway0_res3.branch2.c.weight: (1024, 256, 1, 1, 1)
[03/27 20:47:01] pa.utils.checkpoint INFO: t_res5_0_branch2b_bn_s: (64,) => s5.pathway1_res0.branch2.b_bn.weight: (64,)
[03/27 20:47:01] pa.utils.checkpoint INFO: t_res5_1_branch2b_bn_rm: (64,) => s5.pathway1_res1.branch2.b_bn.running_mean: (64,)
[03/27 20:47:01] pa.utils.checkpoint INFO: t_res4_3_branch2b_bn_s: (32,) => s4.pathway1_res3.branch2.b_bn.weight: (32,)
[03/27 20:47:01] pa.utils.checkpoint INFO: t_pool1_subsample_bn_s: (16,) => s1_fuse.bn.weight: (16,)
[03/27 20:47:01] pa.utils.checkpoint INFO: t_res2_2_branch2b_bn_s: (8,) => s2.pathway1_res2.branch2.b_bn.weight: (8,)
[03/27 20:47:01] pa.utils.checkpoint INFO: res2_1_branch2b_w: (64, 64, 1, 3, 3) => s2.pathway0_res1.branch2.b.weight: (64, 64, 1, 3, 3)
[03/27 20:47:01] pa.utils.checkpoint INFO: t_res5_1_branch2c_bn_rm: (256,) => s5.pathway1_res1.branch2.c_bn.running_mean: (256,)
[03/27 20:47:01] pa.utils.checkpoint INFO: res3_2_branch2a_bn_riv: (128,) => s3.pathway0_res2.branch2.a_bn.running_var: (128,)
[03/27 20:47:01] pa.utils.checkpoint INFO: t_res4_3_branch2b_bn_b: (32,) => s4.pathway1_res3.branch2.b_bn.bias: (32,)
[03/27 20:47:01] pa.utils.checkpoint INFO: t_pool1_subsample_bn_b: (16,) => s1_fuse.bn.bias: (16,)
[03/27 20:47:01] pa.utils.checkpoint INFO: t_res4_0_branch1_bn_s: (128,) => s4.pathway1_res0.branch1_bn.weight: (128,)
[03/27 20:47:01] pa.utils.checkpoint INFO: t_res4_4_branch2b_bn_riv: (32,) => s4.pathway1_res4.branch2.b_bn.running_var: (32,)
[03/27 20:47:01] pa.utils.checkpoint INFO: t_res3_0_branch1_bn_rm: (64,) => s3.pathway1_res0.branch1_bn.running_mean: (64,)
[03/27 20:47:01] pa.utils.checkpoint INFO: t_res3_0_branch2a_bn_rm: (16,) => s3.pathway1_res0.branch2.a_bn.running_mean: (16,)
[03/27 20:47:01] pa.utils.checkpoint INFO: t_res4_3_branch2c_bn_riv: (128,) => s4.pathway1_res3.branch2.c_bn.running_var: (128,)
[03/27 20:47:01] pa.utils.checkpoint INFO: res2_2_branch2c_bn_rm: (256,) => s2.pathway0_res2.branch2.c_bn.running_mean: (256,)
[03/27 20:47:01] pa.utils.checkpoint INFO: !! pred_b: (400,) does not match head.projection.bias: (80,)
[03/27 20:47:01] pa.utils.checkpoint INFO: res3_3_branch2b_w: (128, 128, 1, 3, 3) => s3.pathway0_res3.branch2.b.weight: (128, 128, 1, 3, 3)
[03/27 20:47:01] pa.utils.checkpoint INFO: t_res3_2_branch2a_w: (16, 64, 3, 1, 1) => s3.pathway1_res2.branch2.a.weight: (16, 64, 3, 1, 1)
[03/27 20:47:01] pa.utils.checkpoint INFO: t_res4_2_branch2c_bn_s: (128,) => s4.pathway1_res2.branch2.c_bn.weight: (128,)
[03/27 20:47:01] pa.utils.checkpoint INFO: res2_1_branch2b_bn_riv: (64,) => s2.pathway0_res1.branch2.b_bn.running_var: (64,)
[03/27 20:47:01] pa.utils.checkpoint INFO: !! pred_w: (400, 2304) does not match head.projection.weight: (80, 2304)
[03/27 20:47:01] pa.utils.checkpoint INFO: res4_3_branch2a_w: (256, 1024, 3, 1, 1) => s4.pathway0_res3.branch2.a.weight: (256, 1024, 3, 1, 1)
[03/27 20:47:01] pa.utils.checkpoint INFO: t_res2_2_branch2c_bn_subsample_bn_riv: (64,) => s2_fuse.bn.running_var: (64,)
[03/27 20:47:01] pa.utils.checkpoint INFO: t_res3_3_branch2a_w: (16, 64, 3, 1, 1) => s3.pathway1_res3.branch2.a.weight: (16, 64, 3, 1, 1)
[03/27 20:47:01] pa.utils.checkpoint INFO: t_res3_1_branch2b_bn_riv: (16,) => s3.pathway1_res1.branch2.b_bn.running_var: (16,)
[03/27 20:47:01] pa.utils.checkpoint INFO: res4_1_branch2a_w: (256, 1024, 3, 1, 1) => s4.pathway0_res1.branch2.a.weight: (256, 1024, 3, 1, 1)
[03/27 20:47:01] pa.utils.checkpoint INFO: t_res2_0_branch1_w: (32, 8, 1, 1, 1) => s2.pathway1_res0.branch1.weight: (32, 8, 1, 1, 1)
[03/27 20:47:01] pa.utils.checkpoint INFO: res4_5_branch2c_bn_s: (1024,) => s4.pathway0_res5.branch2.c_bn.weight: (1024,)
[03/27 20:47:01] pa.utils.checkpoint INFO: t_res4_2_branch2c_bn_b: (128,) => s4.pathway1_res2.branch2.c_bn.bias: (128,)
[03/27 20:47:01] pa.utils.checkpoint INFO: t_res2_0_branch2a_bn_s: (8,) => s2.pathway1_res0.branch2.a_bn.weight: (8,)
[03/27 20:47:01] pa.utils.checkpoint INFO: t_res2_2_branch2c_bn_subsample_w: (64, 32, 7, 1, 1) => s2_fuse.conv_f2s.weight: (64, 32, 7, 1, 1)
[03/27 20:47:01] pa.utils.checkpoint INFO: res4_5_branch2a_bn_s: (256,) => s4.pathway0_res5.branch2.a_bn.weight: (256,)
[03/27 20:47:01] pa.utils.checkpoint INFO: res4_3_branch2b_bn_b: (256,) => s4.pathway0_res3.branch2.b_bn.bias: (256,)
[03/27 20:47:01] pa.utils.checkpoint INFO: t_res5_2_branch2a_w: (64, 256, 3, 1, 1) => s5.pathway1_res2.branch2.a.weight: (64, 256, 3, 1, 1)
[03/27 20:47:01] pa.utils.checkpoint INFO: t_pool1_subsample_w: (16, 8, 7, 1, 1) => s1_fuse.conv_f2s.weight: (16, 8, 7, 1, 1)
[03/27 20:47:01] pa.utils.checkpoint INFO: t_res2_0_branch2a_bn_b: (8,) => s2.pathway1_res0.branch2.a_bn.bias: (8,)
[03/27 20:47:01] pa.utils.checkpoint INFO: res4_5_branch2a_bn_b: (256,) => s4.pathway0_res5.branch2.a_bn.bias: (256,)
[03/27 20:47:01] pa.utils.checkpoint INFO: t_res3_3_branch2c_w: (64, 16, 1, 1, 1) => s3.pathway1_res3.branch2.c.weight: (64, 16, 1, 1, 1)
[03/27 20:47:20] pa.datasets.ava_helper INFO: Finished loading image paths from: /public/sist/home/hexm/Projects/pyaction/data/ava/frame_lists/train.csv
[03/27 20:47:39] pa.datasets.ava_helper INFO: Finished loading annotations from: /public/sist/home/hexm/Projects/pyaction/data/ava/annotations/ava_train_v2.2.csv, /public/sist/home/hexm/Projects/pyaction/data/ava/annotations/ava_train_v2.2.csv, /public/sist/home/hexm/Projects/pyaction/data/ava/annotations/person_box_67091280_iou90/ava_detection_train_boxes_and_labels_include_negative_v2.2.csv
[03/27 20:47:39] pa.datasets.ava_helper INFO: Detection threshold: 0.8
[03/27 20:47:39] pa.datasets.ava_helper INFO: Number of unique boxes: 690084
[03/27 20:47:39] pa.datasets.ava_helper INFO: Number of annotations: 2532665
[03/27 20:47:40] pa.datasets.ava_helper INFO: 195528 keyframes used.
[03/27 20:47:40] pa.datasets.ava_dataset INFO: === AVA dataset summary ===
[03/27 20:47:40] pa.datasets.ava_dataset INFO: Split: train
[03/27 20:47:40] pa.datasets.ava_dataset INFO: Number of videos: 235
[03/27 20:47:40] pa.datasets.ava_dataset INFO: Number of frames: 6352104
[03/27 20:47:40] pa.datasets.ava_dataset INFO: Number of key frames: 195528
[03/27 20:47:40] pa.datasets.ava_dataset INFO: Number of boxes: 690084.
[03/27 20:47:44] pa.datasets.ava_helper INFO: Finished loading image paths from: /public/sist/home/hexm/Projects/pyaction/data/ava/frame_lists/val.csv
[03/27 20:47:45] pa.datasets.ava_helper INFO: Finished loading annotations from: /public/sist/home/hexm/Projects/pyaction/data/ava/annotations/person_box_67091280_iou90/ava_detection_val_boxes_and_labels.csv
[03/27 20:47:45] pa.datasets.ava_helper INFO: Detection threshold: 0.8
[03/27 20:47:45] pa.datasets.ava_helper INFO: Number of unique boxes: 24863
[03/27 20:47:45] pa.datasets.ava_helper INFO: Number of annotations: 0
[03/27 20:47:45] pa.datasets.ava_helper INFO: 13169 keyframes used.
[03/27 20:47:45] pa.datasets.ava_dataset INFO: === AVA dataset summary ===
[03/27 20:47:45] pa.datasets.ava_dataset INFO: Split: val
[03/27 20:47:45] pa.datasets.ava_dataset INFO: Number of videos: 64
[03/27 20:47:45] pa.datasets.ava_dataset INFO: Number of frames: 1729931
[03/27 20:47:45] pa.datasets.ava_dataset INFO: Number of key frames: 13169
[03/27 20:47:45] pa.datasets.ava_dataset INFO: Number of boxes: 24863.
[03/27 20:48:08] pa.datasets.ava_helper INFO: Finished loading image paths from: /public/sist/home/hexm/Projects/pyaction/data/ava/frame_lists/train.csv
[03/27 20:48:16] pa.datasets.ava_helper INFO: Finished loading image paths from: /public/sist/home/hexm/Projects/pyaction/data/ava/frame_lists/val.csv
[03/27 20:48:16] pyaction INFO: Start epoch: 1
[03/27 20:59:25] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "10", "eta": "2:30:25", "loss": 0.796501, "lr": 0.000140, "mode": "train", "time_backward": 1.053365, "time_data": 0.016666, "time_diff": 1.479310, "time_forward": 0.397781, "time_loss": 0.000255}
[03/27 21:01:06] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "20", "eta": "2:29:56", "loss": 0.747410, "lr": 0.000156, "mode": "train", "time_backward": 1.052257, "time_data": 0.016767, "time_diff": 1.474750, "time_forward": 0.397484, "time_loss": 0.000427}
[03/27 21:01:25] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "30", "eta": "2:32:08", "loss": 0.674784, "lr": 0.000172, "mode": "train", "time_backward": 1.071878, "time_data": 0.018006, "time_diff": 1.549273, "time_forward": 0.451809, "time_loss": 0.001739}
[03/27 21:02:57] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "40", "eta": "2:31:17", "loss": 0.601069, "lr": 0.000189, "mode": "train", "time_backward": 1.055730, "time_data": 0.016559, "time_diff": 1.477719, "time_forward": 0.397241, "time_loss": 0.000219}
[03/27 21:04:47] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "50", "eta": "2:30:33", "loss": 0.535168, "lr": 0.000205, "mode": "train", "time_backward": 1.051377, "time_data": 0.017000, "time_diff": 1.470868, "time_forward": 0.398615, "time_loss": 0.000410}
[03/27 21:05:03] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "60", "eta": "2:30:01", "loss": 0.475768, "lr": 0.000221, "mode": "train", "time_backward": 1.052909, "time_data": 0.017185, "time_diff": 1.473610, "time_forward": 0.398260, "time_loss": 0.000353}
[03/27 21:07:15] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "70", "eta": "2:29:36", "loss": 0.424088, "lr": 0.000238, "mode": "train", "time_backward": 1.054096, "time_data": 0.016826, "time_diff": 1.476302, "time_forward": 0.398277, "time_loss": 0.000359}
[03/27 21:07:30] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "80", "eta": "2:29:10", "loss": 0.384106, "lr": 0.000254, "mode": "train", "time_backward": 1.050626, "time_data": 0.017274, "time_diff": 1.470264, "time_forward": 0.395730, "time_loss": 0.000213}
[03/27 21:09:06] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "90", "eta": "2:28:52", "loss": 0.349996, "lr": 0.000270, "mode": "train", "time_backward": 1.057339, "time_data": 0.017229, "time_diff": 1.479886, "time_forward": 0.397781, "time_loss": 0.000240}
[03/27 21:10:56] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "100", "eta": "2:28:31", "loss": 0.334040, "lr": 0.000287, "mode": "train", "time_backward": 1.052148, "time_data": 0.016860, "time_diff": 1.472812, "time_forward": 0.397293, "time_loss": 0.000327}
[03/27 21:11:11] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "110", "eta": "2:28:12", "loss": 0.295782, "lr": 0.000303, "mode": "train", "time_backward": 1.052467, "time_data": 0.017784, "time_diff": 1.474913, "time_forward": 0.397803, "time_loss": 0.000425}
[03/27 21:13:09] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "120", "eta": "2:27:55", "loss": 0.290701, "lr": 0.000319, "mode": "train", "time_backward": 1.053718, "time_data": 0.017917, "time_diff": 1.478532, "time_forward": 0.397030, "time_loss": 0.000271}
[03/27 21:15:02] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "130", "eta": "2:27:50", "loss": 0.266116, "lr": 0.000336, "mode": "train", "time_backward": 1.054458, "time_data": 0.017434, "time_diff": 1.501917, "time_forward": 0.426346, "time_loss": 0.000475}
[03/27 21:15:17] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "140", "eta": "2:27:31", "loss": 0.248174, "lr": 0.000352, "mode": "train", "time_backward": 1.052329, "time_data": 0.018224, "time_diff": 1.473923, "time_forward": 0.397865, "time_loss": 0.000248}
[03/27 21:16:54] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "150", "eta": "2:27:14", "loss": 0.237931, "lr": 0.000368, "mode": "train", "time_backward": 1.060402, "time_data": 0.016767, "time_diff": 1.475962, "time_forward": 0.396182, "time_loss": 0.000279}
[03/27 21:17:10] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "160", "eta": "2:26:55", "loss": 0.249510, "lr": 0.000385, "mode": "train", "time_backward": 1.054352, "time_data": 0.016638, "time_diff": 1.471985, "time_forward": 0.397408, "time_loss": 0.000229}
[03/27 21:18:17] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "170", "eta": "2:28:54", "loss": 0.222165, "lr": 0.000401, "mode": "train", "time_backward": 1.443239, "time_data": 0.017308, "time_diff": 1.864279, "time_forward": 0.396350, "time_loss": 0.000241}
[03/27 21:19:17] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "180", "eta": "2:28:30", "loss": 0.213863, "lr": 0.000417, "mode": "train", "time_backward": 1.056159, "time_data": 0.016916, "time_diff": 1.476219, "time_forward": 0.397133, "time_loss": 0.000204}
[03/27 21:19:44] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "190", "eta": "2:28:06", "loss": 0.211041, "lr": 0.000434, "mode": "train", "time_backward": 1.051362, "time_data": 0.016995, "time_diff": 1.473135, "time_forward": 0.397665, "time_loss": 0.000285}
[03/27 21:21:06] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "200", "eta": "2:27:52", "loss": 0.213719, "lr": 0.000450, "mode": "train", "time_backward": 1.083601, "time_data": 0.017246, "time_diff": 1.505653, "time_forward": 0.396507, "time_loss": 0.000226}
[03/27 21:23:04] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "210", "eta": "2:27:38", "loss": 0.197307, "lr": 0.000466, "mode": "train", "time_backward": 1.054447, "time_data": 0.018487, "time_diff": 1.503529, "time_forward": 0.426916, "time_loss": 0.000478}
[03/27 21:23:20] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "220", "eta": "2:27:16", "loss": 0.212175, "lr": 0.000483, "mode": "train", "time_backward": 1.052618, "time_data": 0.017095, "time_diff": 1.474547, "time_forward": 0.397680, "time_loss": 0.000245}
[03/27 21:25:07] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "230", "eta": "2:26:57", "loss": 0.201842, "lr": 0.000499, "mode": "train", "time_backward": 1.052373, "time_data": 0.018848, "time_diff": 1.483158, "time_forward": 0.398974, "time_loss": 0.000271}
[03/27 21:25:29] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "240", "eta": "2:26:36", "loss": 0.206133, "lr": 0.000515, "mode": "train", "time_backward": 1.052207, "time_data": 0.017471, "time_diff": 1.476281, "time_forward": 0.398083, "time_loss": 0.000336}
[03/27 21:27:03] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "250", "eta": "2:26:16", "loss": 0.188669, "lr": 0.000532, "mode": "train", "time_backward": 1.055062, "time_data": 0.016841, "time_diff": 1.477323, "time_forward": 0.397756, "time_loss": 0.000340}
[03/27 21:28:30] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "260", "eta": "2:25:56", "loss": 0.192497, "lr": 0.000548, "mode": "train", "time_backward": 1.054851, "time_data": 0.016997, "time_diff": 1.475561, "time_forward": 0.398187, "time_loss": 0.000330}
[03/27 21:29:20] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "270", "eta": "2:25:37", "loss": 0.184402, "lr": 0.000564, "mode": "train", "time_backward": 1.056176, "time_data": 0.017046, "time_diff": 1.476683, "time_forward": 0.399696, "time_loss": 0.000344}
[03/27 21:30:42] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "280", "eta": "3:28:14", "loss": 0.173302, "lr": 0.000580, "mode": "train", "time_backward": 1.052850, "time_data": 18.145059, "time_diff": 19.608438, "time_forward": 0.406989, "time_loss": 0.000225}
[03/27 21:31:55] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "290", "eta": "3:25:38", "loss": 0.161519, "lr": 0.000597, "mode": "train", "time_backward": 1.054373, "time_data": 0.016896, "time_diff": 1.474579, "time_forward": 0.399754, "time_loss": 0.000372}
[03/27 21:32:38] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "300", "eta": "3:23:12", "loss": 0.186068, "lr": 0.000613, "mode": "train", "time_backward": 1.051568, "time_data": 0.017669, "time_diff": 1.474772, "time_forward": 0.398420, "time_loss": 0.000221}
[03/27 21:34:10] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "310", "eta": "3:20:55", "loss": 0.182363, "lr": 0.000629, "mode": "train", "time_backward": 1.053432, "time_data": 0.017172, "time_diff": 1.475118, "time_forward": 0.397580, "time_loss": 0.000346}
[03/27 21:34:38] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "320", "eta": "3:18:46", "loss": 0.179973, "lr": 0.000646, "mode": "train", "time_backward": 1.058279, "time_data": 0.018114, "time_diff": 1.482690, "time_forward": 0.397865, "time_loss": 0.000331}
[03/27 21:36:19] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "330", "eta": "3:46:25", "loss": 0.163829, "lr": 0.000662, "mode": "train", "time_backward": 11.221742, "time_data": 0.017220, "time_diff": 11.644044, "time_forward": 0.398268, "time_loss": 0.000266}
[03/27 21:37:46] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "340", "eta": "5:22:10", "loss": 0.151912, "lr": 0.000678, "mode": "train", "time_backward": 32.642229, "time_data": 0.017199, "time_diff": 36.338213, "time_forward": 0.397883, "time_loss": 3.275110}
[03/27 21:38:28] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "350", "eta": "5:16:34", "loss": 0.170898, "lr": 0.000695, "mode": "train", "time_backward": 1.090923, "time_data": 0.018068, "time_diff": 1.512890, "time_forward": 0.396538, "time_loss": 0.000312}
[03/27 21:39:29] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "360", "eta": "5:11:10", "loss": 0.170800, "lr": 0.000711, "mode": "train", "time_backward": 1.054977, "time_data": 0.016898, "time_diff": 1.474004, "time_forward": 0.398016, "time_loss": 0.000215}
[03/27 21:41:09] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "370", "eta": "5:56:14", "loss": 0.152782, "lr": 0.000727, "mode": "train", "time_backward": 20.458133, "time_data": 0.017832, "time_diff": 20.880523, "time_forward": 0.399410, "time_loss": 0.000330}
[03/27 21:42:08] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "380", "eta": "5:49:58", "loss": 0.177902, "lr": 0.000744, "mode": "train", "time_backward": 1.052644, "time_data": 0.017727, "time_diff": 1.480615, "time_forward": 0.398914, "time_loss": 0.000336}
[03/27 21:43:17] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "390", "eta": "5:44:00", "loss": 0.154440, "lr": 0.000760, "mode": "train", "time_backward": 1.050875, "time_data": 0.016793, "time_diff": 1.471908, "time_forward": 0.398450, "time_loss": 0.000298}
[03/27 21:43:44] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "400", "eta": "5:38:19", "loss": 0.153339, "lr": 0.000776, "mode": "train", "time_backward": 1.052223, "time_data": 0.016967, "time_diff": 1.471878, "time_forward": 0.397815, "time_loss": 0.000222}
[03/27 21:45:11] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "410", "eta": "6:16:55", "loss": 0.184015, "lr": 0.000793, "mode": "train", "time_backward": 20.042440, "time_data": 0.017011, "time_diff": 20.463268, "time_forward": 0.397134, "time_loss": 0.000226}
[03/27 21:46:00] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "420", "eta": "6:16:34", "loss": 0.164747, "lr": 0.000809, "mode": "train", "time_backward": 3.684728, "time_data": 0.016941, "time_diff": 4.106982, "time_forward": 0.398133, "time_loss": 0.000294}
[03/27 21:46:58] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "430", "eta": "6:10:24", "loss": 0.153369, "lr": 0.000825, "mode": "train", "time_backward": 1.053048, "time_data": 0.016936, "time_diff": 1.471080, "time_forward": 0.397685, "time_loss": 0.000252}
[03/27 21:47:47] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "440", "eta": "6:04:31", "loss": 0.175182, "lr": 0.000842, "mode": "train", "time_backward": 1.051546, "time_data": 0.017186, "time_diff": 1.474974, "time_forward": 0.395956, "time_loss": 0.000228}
[03/27 21:49:07] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "450", "eta": "6:16:00", "loss": 0.161510, "lr": 0.000858, "mode": "train", "time_backward": 9.222620, "time_data": 0.016961, "time_diff": 9.645537, "time_forward": 0.398755, "time_loss": 0.000272}
[03/27 21:50:25] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "460", "eta": "6:10:12", "loss": 0.164523, "lr": 0.000874, "mode": "train", "time_backward": 1.055349, "time_data": 0.017308, "time_diff": 1.474004, "time_forward": 0.397877, "time_loss": 0.000234}
[03/27 21:51:12] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "470", "eta": "6:04:57", "loss": 0.162560, "lr": 0.000891, "mode": "train", "time_backward": 1.058008, "time_data": 0.027508, "time_diff": 1.631439, "time_forward": 0.541804, "time_loss": 0.000988}
[03/27 21:51:55] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "480", "eta": "5:59:36", "loss": 0.168876, "lr": 0.000907, "mode": "train", "time_backward": 1.059024, "time_data": 0.017256, "time_diff": 1.478415, "time_forward": 0.398964, "time_loss": 0.000389}
[03/27 21:53:34] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "490", "eta": "7:09:54", "loss": 0.163798, "lr": 0.000923, "mode": "train", "time_backward": 40.379325, "time_data": 0.017222, "time_diff": 40.935149, "time_forward": 0.399498, "time_loss": 0.000286}
[03/27 21:53:49] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "500", "eta": "7:03:20", "loss": 0.153334, "lr": 0.000940, "mode": "train", "time_backward": 1.074642, "time_data": 0.016865, "time_diff": 1.490427, "time_forward": 0.397845, "time_loss": 0.000228}
[03/27 21:55:07] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "510", "eta": "6:57:02", "loss": 0.143370, "lr": 0.000956, "mode": "train", "time_backward": 1.066618, "time_data": 0.017426, "time_diff": 1.490429, "time_forward": 0.398445, "time_loss": 0.000214}
[03/27 21:55:43] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "520", "eta": "7:03:27", "loss": 0.135021, "lr": 0.000972, "mode": "train", "time_backward": 8.044840, "time_data": 0.017296, "time_diff": 8.468231, "time_forward": 0.398608, "time_loss": 0.000241}
[03/27 21:57:07] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "530", "eta": "6:57:23", "loss": 0.160374, "lr": 0.000989, "mode": "train", "time_backward": 1.051586, "time_data": 0.017840, "time_diff": 1.520633, "time_forward": 0.447569, "time_loss": 0.000275}
[03/27 21:58:18] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "540", "eta": "6:51:32", "loss": 0.147817, "lr": 0.001005, "mode": "train", "time_backward": 1.094881, "time_data": 0.017480, "time_diff": 1.518982, "time_forward": 0.398630, "time_loss": 0.000289}
[03/27 21:59:27] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "550", "eta": "6:45:49", "loss": 0.147927, "lr": 0.001021, "mode": "train", "time_backward": 1.055631, "time_data": 0.017294, "time_diff": 1.480618, "time_forward": 0.398994, "time_loss": 0.000433}
[03/27 22:00:19] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "560", "eta": "6:40:17", "loss": 0.170341, "lr": 0.001038, "mode": "train", "time_backward": 1.052248, "time_data": 0.016960, "time_diff": 1.470422, "time_forward": 0.397323, "time_loss": 0.000286}
[03/27 22:01:01] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "570", "eta": "6:51:22", "loss": 0.147686, "lr": 0.001054, "mode": "train", "time_backward": 11.112814, "time_data": 0.017344, "time_diff": 11.604960, "time_forward": 0.397897, "time_loss": 0.000247}
[03/27 22:01:17] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "580", "eta": "6:45:53", "loss": 0.149297, "lr": 0.001070, "mode": "train", "time_backward": 1.057682, "time_data": 0.016942, "time_diff": 1.477793, "time_forward": 0.399311, "time_loss": 0.000251}
[03/27 22:01:40] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "590", "eta": "6:40:41", "loss": 0.162976, "lr": 0.001087, "mode": "train", "time_backward": 1.118753, "time_data": 0.017491, "time_diff": 1.540719, "time_forward": 0.399568, "time_loss": 0.000289}
[03/27 22:01:56] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "600", "eta": "6:35:36", "loss": 0.149906, "lr": 0.001103, "mode": "train", "time_backward": 1.087569, "time_data": 0.017584, "time_diff": 1.506558, "time_forward": 0.398069, "time_loss": 0.000244}
[03/27 22:02:24] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "610", "eta": "6:49:28", "loss": 0.162621, "lr": 0.001119, "mode": "train", "time_backward": 13.571403, "time_data": 0.016959, "time_diff": 14.012484, "time_forward": 0.403943, "time_loss": 0.000318}
[03/27 22:02:39] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "620", "eta": "6:44:57", "loss": 0.134424, "lr": 0.001136, "mode": "train", "time_backward": 1.112007, "time_data": 0.016829, "time_diff": 1.902491, "time_forward": 0.398381, "time_loss": 0.000422}
[03/27 22:02:55] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "630", "eta": "6:40:00", "loss": 0.167997, "lr": 0.001152, "mode": "train", "time_backward": 1.088647, "time_data": 0.019607, "time_diff": 1.523146, "time_forward": 0.411121, "time_loss": 0.000471}
[03/27 22:03:10] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "640", "eta": "6:35:09", "loss": 0.157839, "lr": 0.001168, "mode": "train", "time_backward": 1.054861, "time_data": 0.017321, "time_diff": 1.493487, "time_forward": 0.417318, "time_loss": 0.000708}
[03/27 22:03:38] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "650", "eta": "6:34:34", "loss": 0.149706, "lr": 0.001185, "mode": "train", "time_backward": 3.996553, "time_data": 0.017501, "time_diff": 4.423802, "time_forward": 0.398211, "time_loss": 0.000266}
[03/27 22:03:53] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "660", "eta": "6:29:55", "loss": 0.156221, "lr": 0.001201, "mode": "train", "time_backward": 1.062213, "time_data": 0.017479, "time_diff": 1.485457, "time_forward": 0.399043, "time_loss": 0.000330}
[03/27 22:04:10] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "670", "eta": "6:25:27", "loss": 0.162571, "lr": 0.001217, "mode": "train", "time_backward": 1.088882, "time_data": 0.017867, "time_diff": 1.514754, "time_forward": 0.399266, "time_loss": 0.000250}
[03/27 22:04:29] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "680", "eta": "6:21:06", "loss": 0.150556, "lr": 0.001234, "mode": "train", "time_backward": 1.090519, "time_data": 0.017553, "time_diff": 1.518262, "time_forward": 0.399317, "time_loss": 0.001391}
[03/27 22:04:56] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "690", "eta": "6:16:55", "loss": 0.155205, "lr": 0.001250, "mode": "train", "time_backward": 1.089102, "time_data": 0.016829, "time_diff": 1.550306, "time_forward": 0.397935, "time_loss": 0.000256}
[03/27 22:05:21] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "700", "eta": "6:12:52", "loss": 0.133667, "lr": 0.001266, "mode": "train", "time_backward": 1.146452, "time_data": 0.016726, "time_diff": 1.565787, "time_forward": 0.401480, "time_loss": 0.000405}
[03/27 22:05:36] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "710", "eta": "6:08:50", "loss": 0.148641, "lr": 0.001282, "mode": "train", "time_backward": 1.072105, "time_data": 0.021942, "time_diff": 1.500258, "time_forward": 0.399089, "time_loss": 0.000349}
[03/27 22:05:52] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "720", "eta": "6:04:54", "loss": 0.151245, "lr": 0.001299, "mode": "train", "time_backward": 1.072454, "time_data": 0.018169, "time_diff": 1.499231, "time_forward": 0.399208, "time_loss": 0.000225}
[03/27 22:06:13] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "730", "eta": "6:01:07", "loss": 0.164559, "lr": 0.001315, "mode": "train", "time_backward": 1.067395, "time_data": 0.030523, "time_diff": 1.530304, "time_forward": 0.417085, "time_loss": 0.000255}
[03/27 22:06:43] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "740", "eta": "5:57:24", "loss": 0.131007, "lr": 0.001331, "mode": "train", "time_backward": 1.089282, "time_data": 0.017580, "time_diff": 1.512240, "time_forward": 0.398680, "time_loss": 0.000257}
[03/27 22:06:58] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "750", "eta": "5:53:45", "loss": 0.143092, "lr": 0.001348, "mode": "train", "time_backward": 1.055532, "time_data": 0.021166, "time_diff": 1.481681, "time_forward": 0.401151, "time_loss": 0.000472}
[03/27 22:07:12] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "760", "eta": "5:50:10", "loss": 0.158331, "lr": 0.001364, "mode": "train", "time_backward": 1.054286, "time_data": 0.017464, "time_diff": 1.475340, "time_forward": 0.400078, "time_loss": 0.000306}
[03/27 22:07:40] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "770", "eta": "5:54:22", "loss": 0.165025, "lr": 0.001380, "mode": "train", "time_backward": 7.694683, "time_data": 0.016736, "time_diff": 8.115639, "time_forward": 0.399095, "time_loss": 0.000443}
[03/27 22:07:54] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "780", "eta": "5:50:51", "loss": 0.150042, "lr": 0.001397, "mode": "train", "time_backward": 1.053310, "time_data": 0.016853, "time_diff": 1.479846, "time_forward": 0.398868, "time_loss": 0.000232}
[03/27 22:08:22] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "790", "eta": "5:48:30", "loss": 0.169183, "lr": 0.001413, "mode": "train", "time_backward": 2.018016, "time_data": 0.017331, "time_diff": 2.435857, "time_forward": 0.398204, "time_loss": 0.000316}
[03/27 22:08:38] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "800", "eta": "5:45:09", "loss": 0.138940, "lr": 0.001429, "mode": "train", "time_backward": 1.061252, "time_data": 0.024304, "time_diff": 1.498025, "time_forward": 0.408849, "time_loss": 0.000436}
[03/27 22:09:03] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "810", "eta": "5:41:55", "loss": 0.152841, "lr": 0.001446, "mode": "train", "time_backward": 1.110704, "time_data": 0.017277, "time_diff": 1.532832, "time_forward": 0.398896, "time_loss": 0.000249}
[03/27 22:09:32] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "820", "eta": "5:39:31", "loss": 0.141205, "lr": 0.001462, "mode": "train", "time_backward": 1.087103, "time_data": 0.540707, "time_diff": 2.236318, "time_forward": 0.601255, "time_loss": 0.000414}
[03/27 22:09:58] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "830", "eta": "5:36:21", "loss": 0.173255, "lr": 0.001478, "mode": "train", "time_backward": 1.054712, "time_data": 0.017665, "time_diff": 1.474602, "time_forward": 0.398567, "time_loss": 0.000242}
[03/27 22:10:13] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "840", "eta": "5:33:18", "loss": 0.165588, "lr": 0.001495, "mode": "train", "time_backward": 1.066730, "time_data": 0.020351, "time_diff": 1.504018, "time_forward": 0.399021, "time_loss": 0.000363}
[03/27 22:10:40] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "850", "eta": "5:30:17", "loss": 0.144301, "lr": 0.001511, "mode": "train", "time_backward": 1.058471, "time_data": 0.017115, "time_diff": 1.489575, "time_forward": 0.406846, "time_loss": 0.001316}
[03/27 22:11:05] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "860", "eta": "5:27:22", "loss": 0.148696, "lr": 0.001527, "mode": "train", "time_backward": 1.087010, "time_data": 0.017303, "time_diff": 1.515935, "time_forward": 0.407527, "time_loss": 0.002593}
[03/27 22:11:21] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "870", "eta": "5:24:36", "loss": 0.143210, "lr": 0.001544, "mode": "train", "time_backward": 1.054483, "time_data": 0.019298, "time_diff": 1.600283, "time_forward": 0.522480, "time_loss": 0.000318}
[03/27 22:11:42] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "880", "eta": "5:21:46", "loss": 0.150830, "lr": 0.001560, "mode": "train", "time_backward": 1.064379, "time_data": 0.017522, "time_diff": 1.483803, "time_forward": 0.398579, "time_loss": 0.000243}
[03/27 22:12:09] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "890", "eta": "5:26:00", "loss": 0.163581, "lr": 0.001576, "mode": "train", "time_backward": 8.234522, "time_data": 0.016979, "time_diff": 8.657140, "time_forward": 0.398605, "time_loss": 0.000315}
[03/27 22:12:41] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "900", "eta": "5:23:30", "loss": 0.141370, "lr": 0.001593, "mode": "train", "time_backward": 1.088871, "time_data": 0.024843, "time_diff": 1.801231, "time_forward": 0.681148, "time_loss": 0.000537}
[03/27 22:12:57] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "910", "eta": "5:20:45", "loss": 0.148991, "lr": 0.001609, "mode": "train", "time_backward": 1.059537, "time_data": 0.017220, "time_diff": 1.483265, "time_forward": 0.398668, "time_loss": 0.000237}
[03/27 22:13:19] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "920", "eta": "5:18:02", "loss": 0.167224, "lr": 0.001625, "mode": "train", "time_backward": 1.054914, "time_data": 0.016888, "time_diff": 1.477551, "time_forward": 0.399213, "time_loss": 0.000235}
[03/27 22:14:00] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "930", "eta": "5:15:26", "loss": 0.146436, "lr": 0.001642, "mode": "train", "time_backward": 1.101017, "time_data": 0.020978, "time_diff": 1.527091, "time_forward": 0.398200, "time_loss": 0.000288}
[03/27 22:14:20] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "940", "eta": "5:12:50", "loss": 0.133575, "lr": 0.001658, "mode": "train", "time_backward": 1.063051, "time_data": 0.017532, "time_diff": 1.484328, "time_forward": 0.401760, "time_loss": 0.000885}
[03/27 22:14:40] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "950", "eta": "5:14:51", "loss": 0.131791, "lr": 0.001674, "mode": "train", "time_backward": 1.071591, "time_data": 4.975110, "time_diff": 6.528105, "time_forward": 0.480613, "time_loss": 0.000411}
[03/27 22:15:13] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "960", "eta": "5:12:18", "loss": 0.152784, "lr": 0.001691, "mode": "train", "time_backward": 1.054455, "time_data": 0.018370, "time_diff": 1.480551, "time_forward": 0.399027, "time_loss": 0.000245}
[03/27 22:15:28] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "970", "eta": "5:09:47", "loss": 0.126014, "lr": 0.001707, "mode": "train", "time_backward": 1.056285, "time_data": 0.017058, "time_diff": 1.477154, "time_forward": 0.400080, "time_loss": 0.000359}
[03/27 22:15:55] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "980", "eta": "5:07:25", "loss": 0.142485, "lr": 0.001723, "mode": "train", "time_backward": 1.130045, "time_data": 0.041449, "time_diff": 1.597061, "time_forward": 0.419270, "time_loss": 0.000260}
[03/27 22:16:10] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "990", "eta": "5:05:00", "loss": 0.150184, "lr": 0.001740, "mode": "train", "time_backward": 1.054335, "time_data": 0.017155, "time_diff": 1.488947, "time_forward": 0.399239, "time_loss": 0.000257}
[03/27 22:16:36] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "1000", "eta": "5:02:37", "loss": 0.147271, "lr": 0.001756, "mode": "train", "time_backward": 1.055825, "time_data": 0.018178, "time_diff": 1.475660, "time_forward": 0.398495, "time_loss": 0.000257}
[03/27 22:17:03] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "1010", "eta": "5:00:17", "loss": 0.147732, "lr": 0.001772, "mode": "train", "time_backward": 1.053297, "time_data": 0.017975, "time_diff": 1.494024, "time_forward": 0.418758, "time_loss": 0.000641}
[03/27 22:17:20] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "1020", "eta": "4:58:00", "loss": 0.145186, "lr": 0.001789, "mode": "train", "time_backward": 1.058790, "time_data": 0.019932, "time_diff": 1.484059, "time_forward": 0.398679, "time_loss": 0.000336}
[03/27 22:17:42] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "1030", "eta": "4:55:45", "loss": 0.140976, "lr": 0.001805, "mode": "train", "time_backward": 1.065280, "time_data": 0.017044, "time_diff": 1.483817, "time_forward": 0.400254, "time_loss": 0.000232}
[03/27 22:17:58] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "1040", "eta": "4:53:36", "loss": 0.139430, "lr": 0.001821, "mode": "train", "time_backward": 1.136734, "time_data": 0.017006, "time_diff": 1.560068, "time_forward": 0.399294, "time_loss": 0.000280}
[03/27 22:18:14] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "1050", "eta": "4:51:25", "loss": 0.146927, "lr": 0.001838, "mode": "train", "time_backward": 1.054023, "time_data": 0.017837, "time_diff": 1.476508, "time_forward": 0.397607, "time_loss": 0.000315}
[03/27 22:18:34] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "1060", "eta": "4:49:16", "loss": 0.143735, "lr": 0.001854, "mode": "train", "time_backward": 1.053472, "time_data": 0.016957, "time_diff": 1.473968, "time_forward": 0.400215, "time_loss": 0.000223}
[03/27 22:18:58] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "1070", "eta": "4:47:13", "loss": 0.154007, "lr": 0.001870, "mode": "train", "time_backward": 1.121970, "time_data": 0.020095, "time_diff": 1.552930, "time_forward": 0.403020, "time_loss": 0.000282}
[03/27 22:19:25] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "1080", "eta": "4:45:09", "loss": 0.159459, "lr": 0.001887, "mode": "train", "time_backward": 1.054598, "time_data": 0.021275, "time_diff": 1.490590, "time_forward": 0.408605, "time_loss": 0.000399}
[03/27 22:19:44] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "1090", "eta": "4:43:08", "loss": 0.143370, "lr": 0.001903, "mode": "train", "time_backward": 1.103234, "time_data": 0.018008, "time_diff": 1.527588, "time_forward": 0.399347, "time_loss": 0.000318}
[03/27 22:20:30] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "1100", "eta": "4:41:23", "loss": 0.129794, "lr": 0.001919, "mode": "train", "time_backward": 1.349145, "time_data": 0.017664, "time_diff": 1.815598, "time_forward": 0.398551, "time_loss": 0.000239}
[03/27 22:20:45] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "1110", "eta": "4:39:24", "loss": 0.140588, "lr": 0.001936, "mode": "train", "time_backward": 1.056149, "time_data": 0.017409, "time_diff": 1.477907, "time_forward": 0.400803, "time_loss": 0.000408}
[03/27 22:21:00] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "1120", "eta": "4:37:28", "loss": 0.127904, "lr": 0.001952, "mode": "train", "time_backward": 1.086282, "time_data": 0.018142, "time_diff": 1.507012, "time_forward": 0.399130, "time_loss": 0.000235}
[03/27 22:21:15] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "1130", "eta": "4:35:36", "loss": 0.142553, "lr": 0.001968, "mode": "train", "time_backward": 1.077644, "time_data": 0.020713, "time_diff": 1.539958, "time_forward": 0.398903, "time_loss": 0.000263}
[03/27 22:21:34] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "1140", "eta": "4:33:43", "loss": 0.139058, "lr": 0.001984, "mode": "train", "time_backward": 1.066161, "time_data": 0.021508, "time_diff": 1.499459, "time_forward": 0.403298, "time_loss": 0.000248}
[03/27 22:22:08] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "1150", "eta": "4:31:53", "loss": 0.130474, "lr": 0.002001, "mode": "train", "time_backward": 1.054566, "time_data": 0.017513, "time_diff": 1.509046, "time_forward": 0.433300, "time_loss": 0.000314}
[03/27 22:22:23] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "1160", "eta": "4:30:02", "loss": 0.155822, "lr": 0.002017, "mode": "train", "time_backward": 1.053787, "time_data": 0.016667, "time_diff": 1.474332, "time_forward": 0.400381, "time_loss": 0.000285}
[03/27 22:22:48] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "1170", "eta": "4:28:16", "loss": 0.153975, "lr": 0.002033, "mode": "train", "time_backward": 1.094929, "time_data": 0.018366, "time_diff": 1.519440, "time_forward": 0.399145, "time_loss": 0.000226}
[03/27 22:23:05] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "1180", "eta": "4:26:29", "loss": 0.157589, "lr": 0.002050, "mode": "train", "time_backward": 1.070971, "time_data": 0.017091, "time_diff": 1.490752, "time_forward": 0.399334, "time_loss": 0.000244}
[03/27 22:23:40] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "1190", "eta": "4:24:44", "loss": 0.152881, "lr": 0.002066, "mode": "train", "time_backward": 1.053855, "time_data": 0.016980, "time_diff": 1.476261, "time_forward": 0.398876, "time_loss": 0.000235}
[03/27 22:24:08] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "1200", "eta": "4:23:00", "loss": 0.134953, "lr": 0.002082, "mode": "train", "time_backward": 1.056945, "time_data": 0.017430, "time_diff": 1.477504, "time_forward": 0.399605, "time_loss": 0.000373}
[03/27 22:24:35] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "1210", "eta": "4:21:18", "loss": 0.154997, "lr": 0.002099, "mode": "train", "time_backward": 1.052423, "time_data": 0.018998, "time_diff": 1.479214, "time_forward": 0.403582, "time_loss": 0.000251}
[03/27 22:24:53] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "1220", "eta": "4:19:37", "loss": 0.142562, "lr": 0.002115, "mode": "train", "time_backward": 1.071854, "time_data": 0.017256, "time_diff": 1.493195, "time_forward": 0.400981, "time_loss": 0.000298}
[03/27 22:25:10] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "1230", "eta": "4:17:59", "loss": 0.143735, "lr": 0.002131, "mode": "train", "time_backward": 1.063342, "time_data": 0.017937, "time_diff": 1.525792, "time_forward": 0.440863, "time_loss": 0.000450}
[03/27 22:25:26] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "1240", "eta": "4:16:23", "loss": 0.135304, "lr": 0.002148, "mode": "train", "time_backward": 1.092490, "time_data": 0.020145, "time_diff": 1.520900, "time_forward": 0.400717, "time_loss": 0.000773}
[03/27 22:25:59] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "1250", "eta": "4:14:47", "loss": 0.137106, "lr": 0.002164, "mode": "train", "time_backward": 1.072074, "time_data": 0.017752, "time_diff": 1.495160, "time_forward": 0.397929, "time_loss": 0.000228}
[03/27 22:26:23] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "1260", "eta": "4:13:17", "loss": 0.124848, "lr": 0.002180, "mode": "train", "time_backward": 1.160608, "time_data": 0.075574, "time_diff": 1.644450, "time_forward": 0.404810, "time_loss": 0.000298}
[03/27 22:26:44] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "1270", "eta": "4:11:43", "loss": 0.144915, "lr": 0.002197, "mode": "train", "time_backward": 1.051381, "time_data": 0.016786, "time_diff": 1.475906, "time_forward": 0.402292, "time_loss": 0.000362}
[03/27 22:27:02] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "1280", "eta": "4:10:13", "loss": 0.139624, "lr": 0.002213, "mode": "train", "time_backward": 1.132212, "time_data": 0.018346, "time_diff": 1.565456, "time_forward": 0.406667, "time_loss": 0.000251}
[03/27 22:27:28] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "1290", "eta": "4:09:00", "loss": 0.138912, "lr": 0.002229, "mode": "train", "time_backward": 1.103956, "time_data": 0.124435, "time_diff": 1.974136, "time_forward": 0.660788, "time_loss": 0.016998}
[03/27 22:27:47] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "1300", "eta": "4:07:29", "loss": 0.152704, "lr": 0.002246, "mode": "train", "time_backward": 1.054262, "time_data": 0.016609, "time_diff": 1.476661, "time_forward": 0.396675, "time_loss": 0.000312}
[03/27 22:28:03] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "1310", "eta": "4:06:00", "loss": 0.138115, "lr": 0.002262, "mode": "train", "time_backward": 1.060624, "time_data": 0.035972, "time_diff": 1.519364, "time_forward": 0.406444, "time_loss": 0.000266}
[03/27 22:28:31] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "1320", "eta": "4:04:39", "loss": 0.162405, "lr": 0.002278, "mode": "train", "time_backward": 1.125318, "time_data": 0.017625, "time_diff": 1.674234, "time_forward": 0.527679, "time_loss": 0.000274}
[03/27 22:29:06] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "1330", "eta": "4:03:21", "loss": 0.147378, "lr": 0.002295, "mode": "train", "time_backward": 1.321623, "time_data": 0.020148, "time_diff": 1.763486, "time_forward": 0.400558, "time_loss": 0.000374}
[03/27 22:29:22] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "1340", "eta": "4:01:55", "loss": 0.139122, "lr": 0.002311, "mode": "train", "time_backward": 1.054987, "time_data": 0.018147, "time_diff": 1.481348, "time_forward": 0.402263, "time_loss": 0.000967}
[03/27 22:29:43] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "1350", "eta": "4:00:31", "loss": 0.143714, "lr": 0.002327, "mode": "train", "time_backward": 1.053805, "time_data": 0.021368, "time_diff": 1.525817, "time_forward": 0.447013, "time_loss": 0.000406}
[03/27 22:29:58] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "1360", "eta": "3:59:10", "loss": 0.146360, "lr": 0.002344, "mode": "train", "time_backward": 1.056441, "time_data": 0.037209, "time_diff": 1.577687, "time_forward": 0.478298, "time_loss": 0.000422}
[03/27 22:30:19] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "1370", "eta": "3:57:46", "loss": 0.143030, "lr": 0.002360, "mode": "train", "time_backward": 1.054221, "time_data": 0.017741, "time_diff": 1.477258, "time_forward": 0.398967, "time_loss": 0.000367}
[03/27 22:30:52] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "1380", "eta": "3:56:24", "loss": 0.149502, "lr": 0.002376, "mode": "train", "time_backward": 1.056672, "time_data": 0.017033, "time_diff": 1.475340, "time_forward": 0.398391, "time_loss": 0.000269}
[03/27 22:31:12] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "1390", "eta": "3:55:02", "loss": 0.141344, "lr": 0.002393, "mode": "train", "time_backward": 1.056735, "time_data": 0.018826, "time_diff": 1.478938, "time_forward": 0.399790, "time_loss": 0.000717}
[03/27 22:31:28] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "1400", "eta": "3:53:41", "loss": 0.155706, "lr": 0.002409, "mode": "train", "time_backward": 1.053596, "time_data": 0.017068, "time_diff": 1.475769, "time_forward": 0.398414, "time_loss": 0.000256}
[03/27 22:32:01] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "1410", "eta": "3:52:22", "loss": 0.148994, "lr": 0.002425, "mode": "train", "time_backward": 1.054394, "time_data": 0.017892, "time_diff": 1.483651, "time_forward": 0.398395, "time_loss": 0.000251}
[03/27 22:32:27] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "1420", "eta": "3:56:02", "loss": 0.141108, "lr": 0.002442, "mode": "train", "time_backward": 1.096980, "time_data": 9.019706, "time_diff": 10.540378, "time_forward": 0.418105, "time_loss": 0.000494}
[03/27 22:32:42] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "1430", "eta": "3:54:44", "loss": 0.152388, "lr": 0.002458, "mode": "train", "time_backward": 1.125795, "time_data": 0.018477, "time_diff": 1.551215, "time_forward": 0.397988, "time_loss": 0.000348}
[03/27 22:33:03] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "1440", "eta": "3:53:24", "loss": 0.164070, "lr": 0.002474, "mode": "train", "time_backward": 1.055050, "time_data": 0.017145, "time_diff": 1.476958, "time_forward": 0.398807, "time_loss": 0.000284}
[03/27 22:33:18] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "1450", "eta": "3:52:08", "loss": 0.125588, "lr": 0.002491, "mode": "train", "time_backward": 1.122917, "time_data": 0.022206, "time_diff": 1.557999, "time_forward": 0.409094, "time_loss": 0.000494}
[03/27 22:33:50] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "1460", "eta": "3:50:50", "loss": 0.126930, "lr": 0.002507, "mode": "train", "time_backward": 1.054145, "time_data": 0.017124, "time_diff": 1.474693, "time_forward": 0.398635, "time_loss": 0.000392}
[03/27 22:34:08] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "1470", "eta": "3:49:33", "loss": 0.149064, "lr": 0.002523, "mode": "train", "time_backward": 1.060550, "time_data": 0.017044, "time_diff": 1.480239, "time_forward": 0.399306, "time_loss": 0.000272}
[03/27 22:34:23] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "1480", "eta": "3:48:21", "loss": 0.149602, "lr": 0.002540, "mode": "train", "time_backward": 1.175358, "time_data": 0.019614, "time_diff": 1.608583, "time_forward": 0.397837, "time_loss": 0.000239}
[03/27 22:34:44] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "1490", "eta": "3:47:05", "loss": 0.136483, "lr": 0.002556, "mode": "train", "time_backward": 1.054927, "time_data": 0.017485, "time_diff": 1.480768, "time_forward": 0.398512, "time_loss": 0.000201}
[03/27 22:35:44] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "1500", "eta": "3:52:00", "loss": 0.143263, "lr": 0.002572, "mode": "train", "time_backward": 1.291864, "time_data": 9.861159, "time_diff": 13.493640, "time_forward": 2.171577, "time_loss": 0.107430}
[03/27 22:35:59] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "1510", "eta": "3:50:43", "loss": 0.135325, "lr": 0.002589, "mode": "train", "time_backward": 1.053569, "time_data": 0.017051, "time_diff": 1.474679, "time_forward": 0.400651, "time_loss": 0.000299}
[03/27 22:36:19] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "1520", "eta": "3:49:26", "loss": 0.147445, "lr": 0.002605, "mode": "train", "time_backward": 1.055435, "time_data": 0.017228, "time_diff": 1.480973, "time_forward": 0.399659, "time_loss": 0.000264}
[03/27 22:36:40] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "1530", "eta": "3:48:12", "loss": 0.139491, "lr": 0.002621, "mode": "train", "time_backward": 1.089395, "time_data": 0.017653, "time_diff": 1.506659, "time_forward": 0.399087, "time_loss": 0.000246}
[03/27 22:37:00] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "1540", "eta": "3:47:00", "loss": 0.159188, "lr": 0.002638, "mode": "train", "time_backward": 1.132385, "time_data": 0.020617, "time_diff": 1.567581, "time_forward": 0.398594, "time_loss": 0.000345}
[03/27 22:37:42] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "1550", "eta": "3:45:46", "loss": 0.146725, "lr": 0.002654, "mode": "train", "time_backward": 1.057087, "time_data": 0.017533, "time_diff": 1.479938, "time_forward": 0.397943, "time_loss": 0.000276}
[03/27 22:38:04] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "1560", "eta": "3:48:02", "loss": 0.128465, "lr": 0.002670, "mode": "train", "time_backward": 1.056139, "time_data": 7.197983, "time_diff": 8.657892, "time_forward": 0.400150, "time_loss": 0.000359}
[03/27 22:38:47] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "1570", "eta": "3:46:48", "loss": 0.122341, "lr": 0.002687, "mode": "train", "time_backward": 1.058877, "time_data": 0.017101, "time_diff": 1.478163, "time_forward": 0.398552, "time_loss": 0.000399}
[03/27 22:39:12] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "1580", "eta": "3:45:34", "loss": 0.150990, "lr": 0.002703, "mode": "train", "time_backward": 1.069132, "time_data": 0.016979, "time_diff": 1.488708, "time_forward": 0.399035, "time_loss": 0.000376}
[03/27 22:39:27] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "1590", "eta": "3:44:22", "loss": 0.136801, "lr": 0.002719, "mode": "train", "time_backward": 1.070556, "time_data": 0.018171, "time_diff": 1.490588, "time_forward": 0.398400, "time_loss": 0.000244}
[03/27 22:39:56] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "1600", "eta": "3:43:10", "loss": 0.134250, "lr": 0.002735, "mode": "train", "time_backward": 1.054042, "time_data": 0.016664, "time_diff": 1.473936, "time_forward": 0.398010, "time_loss": 0.000224}
[03/27 22:40:16] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "1610", "eta": "3:41:59", "loss": 0.144622, "lr": 0.002752, "mode": "train", "time_backward": 1.078329, "time_data": 0.017929, "time_diff": 1.503641, "time_forward": 0.399770, "time_loss": 0.000351}
[03/27 22:40:31] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "1620", "eta": "3:40:50", "loss": 0.140347, "lr": 0.002768, "mode": "train", "time_backward": 1.104737, "time_data": 0.016982, "time_diff": 1.528457, "time_forward": 0.398771, "time_loss": 0.000265}
[03/27 22:41:20] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "1630", "eta": "3:39:42", "loss": 0.138380, "lr": 0.002784, "mode": "train", "time_backward": 1.070375, "time_data": 0.016928, "time_diff": 1.563649, "time_forward": 0.403709, "time_loss": 0.003934}
[03/27 22:41:45] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "1640", "eta": "3:42:20", "loss": 0.143954, "lr": 0.002801, "mode": "train", "time_backward": 1.064315, "time_data": 8.242574, "time_diff": 9.791434, "time_forward": 0.427884, "time_loss": 0.000341}
[03/27 22:42:00] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "1650", "eta": "3:41:10", "loss": 0.152687, "lr": 0.002817, "mode": "train", "time_backward": 1.053968, "time_data": 0.017301, "time_diff": 1.522055, "time_forward": 0.447317, "time_loss": 0.000228}
[03/27 22:42:41] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "1660", "eta": "3:40:02", "loss": 0.135557, "lr": 0.002833, "mode": "train", "time_backward": 1.102106, "time_data": 0.017856, "time_diff": 1.528704, "time_forward": 0.398456, "time_loss": 0.000247}
[03/27 22:42:57] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "1670", "eta": "3:38:53", "loss": 0.132410, "lr": 0.002850, "mode": "train", "time_backward": 1.070026, "time_data": 0.019296, "time_diff": 1.494923, "time_forward": 0.397951, "time_loss": 0.000267}
[03/27 22:43:25] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "1680", "eta": "3:37:44", "loss": 0.149647, "lr": 0.002866, "mode": "train", "time_backward": 1.054618, "time_data": 0.016797, "time_diff": 1.474514, "time_forward": 0.398587, "time_loss": 0.000240}
[03/27 22:43:46] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "1690", "eta": "3:36:39", "loss": 0.141663, "lr": 0.002882, "mode": "train", "time_backward": 1.134591, "time_data": 0.018090, "time_diff": 1.586759, "time_forward": 0.426827, "time_loss": 0.000342}
[03/27 22:44:02] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "1700", "eta": "3:35:34", "loss": 0.130665, "lr": 0.002899, "mode": "train", "time_backward": 1.135230, "time_data": 0.018139, "time_diff": 1.559319, "time_forward": 0.399139, "time_loss": 0.000267}
[03/27 22:44:21] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "1710", "eta": "3:34:28", "loss": 0.142660, "lr": 0.002915, "mode": "train", "time_backward": 1.062547, "time_data": 0.033089, "time_diff": 1.528310, "time_forward": 0.429263, "time_loss": 0.000976}
[03/27 22:44:36] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "1720", "eta": "3:33:23", "loss": 0.156888, "lr": 0.002931, "mode": "train", "time_backward": 1.093578, "time_data": 0.016962, "time_diff": 1.516339, "time_forward": 0.399273, "time_loss": 0.000287}
[03/27 22:44:53] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "1730", "eta": "3:32:18", "loss": 0.133208, "lr": 0.002948, "mode": "train", "time_backward": 1.072293, "time_data": 0.017187, "time_diff": 1.499698, "time_forward": 0.405799, "time_loss": 0.000385}
[03/27 22:45:10] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "1740", "eta": "3:31:13", "loss": 0.153556, "lr": 0.002964, "mode": "train", "time_backward": 1.053071, "time_data": 0.016776, "time_diff": 1.472275, "time_forward": 0.398428, "time_loss": 0.000597}
[03/27 22:45:36] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "1750", "eta": "3:30:11", "loss": 0.156887, "lr": 0.002980, "mode": "train", "time_backward": 1.159470, "time_data": 0.019141, "time_diff": 1.581477, "time_forward": 0.399269, "time_loss": 0.000420}
[03/27 22:45:55] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "1760", "eta": "3:29:08", "loss": 0.146105, "lr": 0.002997, "mode": "train", "time_backward": 1.058155, "time_data": 0.017120, "time_diff": 1.482068, "time_forward": 0.398720, "time_loss": 0.000260}
[03/27 22:46:10] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "1770", "eta": "3:28:04", "loss": 0.153017, "lr": 0.003013, "mode": "train", "time_backward": 1.054743, "time_data": 0.017251, "time_diff": 1.483234, "time_forward": 0.398315, "time_loss": 0.002526}
[03/27 22:46:36] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "1780", "eta": "3:27:03", "loss": 0.130249, "lr": 0.003029, "mode": "train", "time_backward": 1.102861, "time_data": 0.017069, "time_diff": 1.524758, "time_forward": 0.402049, "time_loss": 0.000290}
[03/27 22:46:58] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "1790", "eta": "3:26:22", "loss": 0.153904, "lr": 0.003046, "mode": "train", "time_backward": 1.963759, "time_data": 0.017458, "time_diff": 2.387643, "time_forward": 0.398414, "time_loss": 0.000244}
[03/27 22:47:15] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "1800", "eta": "3:25:21", "loss": 0.139315, "lr": 0.003062, "mode": "train", "time_backward": 1.054361, "time_data": 0.022994, "time_diff": 1.478882, "time_forward": 0.398079, "time_loss": 0.000255}
[03/27 22:47:38] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "1810", "eta": "3:24:19", "loss": 0.133083, "lr": 0.003078, "mode": "train", "time_backward": 1.053378, "time_data": 0.017110, "time_diff": 1.478189, "time_forward": 0.398909, "time_loss": 0.000357}
[03/27 22:47:58] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "1820", "eta": "3:23:19", "loss": 0.153576, "lr": 0.003095, "mode": "train", "time_backward": 1.071701, "time_data": 0.017111, "time_diff": 1.512178, "time_forward": 0.401817, "time_loss": 0.000271}
[03/27 22:48:30] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "1830", "eta": "3:22:20", "loss": 0.146088, "lr": 0.003111, "mode": "train", "time_backward": 1.053532, "time_data": 0.017267, "time_diff": 1.512790, "time_forward": 0.398206, "time_loss": 0.000243}
[03/27 22:48:59] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "1840", "eta": "3:21:20", "loss": 0.147775, "lr": 0.003127, "mode": "train", "time_backward": 1.055850, "time_data": 0.018320, "time_diff": 1.479394, "time_forward": 0.398763, "time_loss": 0.000290}
[03/27 22:49:28] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "1850", "eta": "3:20:21", "loss": 0.154942, "lr": 0.003144, "mode": "train", "time_backward": 1.062932, "time_data": 0.017451, "time_diff": 1.509726, "time_forward": 0.398619, "time_loss": 0.000259}
[03/27 22:49:43] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "1860", "eta": "3:19:23", "loss": 0.120763, "lr": 0.003160, "mode": "train", "time_backward": 1.070600, "time_data": 0.020143, "time_diff": 1.494204, "time_forward": 0.397878, "time_loss": 0.000293}
[03/27 22:50:12] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "1870", "eta": "3:18:24", "loss": 0.156296, "lr": 0.003176, "mode": "train", "time_backward": 1.055783, "time_data": 0.016929, "time_diff": 1.476612, "time_forward": 0.398540, "time_loss": 0.000238}
[03/27 22:50:48] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "1880", "eta": "3:25:02", "loss": 0.135926, "lr": 0.003193, "mode": "train", "time_backward": 1.054101, "time_data": 20.266778, "time_diff": 21.732164, "time_forward": 0.407729, "time_loss": 0.000311}
[03/27 22:51:04] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "1890", "eta": "3:24:01", "loss": 0.146311, "lr": 0.003209, "mode": "train", "time_backward": 1.059234, "time_data": 0.016990, "time_diff": 1.481059, "time_forward": 0.398241, "time_loss": 0.000347}
[03/27 22:51:27] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "1900", "eta": "3:23:01", "loss": 0.116662, "lr": 0.003225, "mode": "train", "time_backward": 1.062806, "time_data": 0.016792, "time_diff": 1.482253, "time_forward": 0.398848, "time_loss": 0.000250}
[03/27 22:52:01] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "1910", "eta": "3:22:02", "loss": 0.135179, "lr": 0.003242, "mode": "train", "time_backward": 1.108808, "time_data": 0.017540, "time_diff": 1.531824, "time_forward": 0.398885, "time_loss": 0.000397}
[03/27 22:52:16] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "1920", "eta": "3:21:02", "loss": 0.117497, "lr": 0.003258, "mode": "train", "time_backward": 1.054402, "time_data": 0.017241, "time_diff": 1.475112, "time_forward": 0.399432, "time_loss": 0.000333}
[03/27 22:52:50] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "1930", "eta": "3:20:03", "loss": 0.131024, "lr": 0.003274, "mode": "train", "time_backward": 1.053350, "time_data": 0.016801, "time_diff": 1.472783, "time_forward": 0.398224, "time_loss": 0.000259}
[03/27 22:53:17] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "1940", "eta": "3:19:04", "loss": 0.157415, "lr": 0.003291, "mode": "train", "time_backward": 1.052274, "time_data": 0.020663, "time_diff": 1.476164, "time_forward": 0.398551, "time_loss": 0.000306}
[03/27 22:53:49] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "1950", "eta": "3:18:07", "loss": 0.145291, "lr": 0.003307, "mode": "train", "time_backward": 1.096833, "time_data": 0.016983, "time_diff": 1.521068, "time_forward": 0.399126, "time_loss": 0.000393}
[03/27 22:54:20] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "1960", "eta": "3:22:45", "loss": 0.117525, "lr": 0.003323, "mode": "train", "time_backward": 1.057276, "time_data": 15.437469, "time_diff": 17.353010, "time_forward": 0.843701, "time_loss": 0.000387}
[03/27 22:54:40] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "1970", "eta": "3:21:45", "loss": 0.127661, "lr": 0.003340, "mode": "train", "time_backward": 1.051449, "time_data": 0.017500, "time_diff": 1.473649, "time_forward": 0.398595, "time_loss": 0.000256}
[03/27 22:55:20] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "1980", "eta": "3:20:46", "loss": 0.144987, "lr": 0.003356, "mode": "train", "time_backward": 1.053476, "time_data": 0.017081, "time_diff": 1.482495, "time_forward": 0.397402, "time_loss": 0.000201}
[03/27 22:55:35] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "1990", "eta": "3:19:48", "loss": 0.128040, "lr": 0.003372, "mode": "train", "time_backward": 1.078672, "time_data": 0.017341, "time_diff": 1.500623, "time_forward": 0.397379, "time_loss": 0.000208}
[03/27 22:56:46] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "2000", "eta": "3:18:51", "loss": 0.118503, "lr": 0.003389, "mode": "train", "time_backward": 1.067145, "time_data": 0.018384, "time_diff": 1.554814, "time_forward": 0.416290, "time_loss": 0.000360}
[03/27 22:57:02] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "2010", "eta": "3:17:56", "loss": 0.130514, "lr": 0.003405, "mode": "train", "time_backward": 1.084543, "time_data": 0.016628, "time_diff": 1.660804, "time_forward": 0.501463, "time_loss": 0.000359}
[03/27 22:57:18] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "2020", "eta": "3:17:00", "loss": 0.142261, "lr": 0.003421, "mode": "train", "time_backward": 1.093999, "time_data": 0.038059, "time_diff": 1.540323, "time_forward": 0.398975, "time_loss": 0.000813}
[03/27 22:57:33] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "2030", "eta": "3:16:03", "loss": 0.142061, "lr": 0.003437, "mode": "train", "time_backward": 1.069495, "time_data": 0.016841, "time_diff": 1.491582, "time_forward": 0.398042, "time_loss": 0.000386}
[03/27 22:57:50] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "2040", "eta": "3:15:06", "loss": 0.133337, "lr": 0.003454, "mode": "train", "time_backward": 1.073931, "time_data": 0.017071, "time_diff": 1.493930, "time_forward": 0.399508, "time_loss": 0.000371}
[03/27 22:58:05] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "2050", "eta": "3:14:10", "loss": 0.138999, "lr": 0.003470, "mode": "train", "time_backward": 1.054603, "time_data": 0.017240, "time_diff": 1.477165, "time_forward": 0.399017, "time_loss": 0.000251}
[03/27 22:58:21] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "2060", "eta": "3:13:14", "loss": 0.133348, "lr": 0.003486, "mode": "train", "time_backward": 1.072123, "time_data": 0.017521, "time_diff": 1.492669, "time_forward": 0.398307, "time_loss": 0.000269}
[03/27 22:58:37] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "2070", "eta": "3:12:18", "loss": 0.136119, "lr": 0.003503, "mode": "train", "time_backward": 1.054900, "time_data": 0.017714, "time_diff": 1.476168, "time_forward": 0.398422, "time_loss": 0.000254}
[03/27 22:58:55] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "2080", "eta": "3:11:26", "loss": 0.136182, "lr": 0.003519, "mode": "train", "time_backward": 1.159283, "time_data": 0.018988, "time_diff": 1.598277, "time_forward": 0.400060, "time_loss": 0.000270}
[03/27 22:59:10] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "2090", "eta": "3:10:33", "loss": 0.133289, "lr": 0.003535, "mode": "train", "time_backward": 1.163532, "time_data": 0.022567, "time_diff": 1.608783, "time_forward": 0.401583, "time_loss": 0.000369}
[03/27 22:59:26] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "2100", "eta": "3:09:40", "loss": 0.136651, "lr": 0.003552, "mode": "train", "time_backward": 1.083823, "time_data": 0.017619, "time_diff": 1.569890, "time_forward": 0.408689, "time_loss": 0.000263}
[03/27 22:59:42] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "2110", "eta": "3:08:48", "loss": 0.142917, "lr": 0.003568, "mode": "train", "time_backward": 1.145214, "time_data": 0.019237, "time_diff": 1.580659, "time_forward": 0.397536, "time_loss": 0.000228}
[03/27 22:59:57] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "2120", "eta": "3:07:55", "loss": 0.148515, "lr": 0.003584, "mode": "train", "time_backward": 1.069726, "time_data": 0.016881, "time_diff": 1.504978, "time_forward": 0.399266, "time_loss": 0.000254}
[03/27 23:00:14] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "2130", "eta": "3:07:01", "loss": 0.128062, "lr": 0.003601, "mode": "train", "time_backward": 1.056952, "time_data": 0.017011, "time_diff": 1.476769, "time_forward": 0.398817, "time_loss": 0.000534}
[03/27 23:00:29] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "2140", "eta": "3:06:10", "loss": 0.126317, "lr": 0.003617, "mode": "train", "time_backward": 1.079076, "time_data": 0.017939, "time_diff": 1.580278, "time_forward": 0.410398, "time_loss": 0.000887}
[03/27 23:00:49] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "2150", "eta": "3:05:17", "loss": 0.136728, "lr": 0.003633, "mode": "train", "time_backward": 1.050142, "time_data": 0.016713, "time_diff": 1.471760, "time_forward": 0.397372, "time_loss": 0.000254}
[03/27 23:01:06] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "2160", "eta": "3:04:25", "loss": 0.121947, "lr": 0.003650, "mode": "train", "time_backward": 1.075411, "time_data": 0.017200, "time_diff": 1.497553, "time_forward": 0.397743, "time_loss": 0.000236}
[03/27 23:01:22] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "2170", "eta": "3:03:34", "loss": 0.136676, "lr": 0.003666, "mode": "train", "time_backward": 1.104180, "time_data": 0.016931, "time_diff": 1.522846, "time_forward": 0.398241, "time_loss": 0.000336}
[03/27 23:01:41] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "2180", "eta": "3:03:20", "loss": 0.152131, "lr": 0.003682, "mode": "train", "time_backward": 1.803231, "time_data": 0.026943, "time_diff": 3.551698, "time_forward": 1.507258, "time_loss": 0.193294}
[03/27 23:01:57] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "2190", "eta": "3:02:29", "loss": 0.139017, "lr": 0.003699, "mode": "train", "time_backward": 1.081051, "time_data": 0.021176, "time_diff": 1.509877, "time_forward": 0.400416, "time_loss": 0.006797}
[03/27 23:02:13] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "2200", "eta": "3:01:38", "loss": 0.150878, "lr": 0.003715, "mode": "train", "time_backward": 1.080330, "time_data": 0.021379, "time_diff": 1.522915, "time_forward": 0.407511, "time_loss": 0.000706}
[03/27 23:02:28] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "2210", "eta": "3:00:49", "loss": 0.130962, "lr": 0.003731, "mode": "train", "time_backward": 1.099755, "time_data": 0.046778, "time_diff": 1.553654, "time_forward": 0.401916, "time_loss": 0.000281}
[03/27 23:02:44] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "2220", "eta": "2:59:58", "loss": 0.125831, "lr": 0.003748, "mode": "train", "time_backward": 1.057991, "time_data": 0.020169, "time_diff": 1.492963, "time_forward": 0.410435, "time_loss": 0.001133}
[03/27 23:02:59] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "2230", "eta": "2:59:09", "loss": 0.124568, "lr": 0.003764, "mode": "train", "time_backward": 1.133832, "time_data": 0.017130, "time_diff": 1.556499, "time_forward": 0.397742, "time_loss": 0.000259}
[03/27 23:03:15] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "2240", "eta": "2:58:20", "loss": 0.133387, "lr": 0.003780, "mode": "train", "time_backward": 1.068819, "time_data": 0.016984, "time_diff": 1.490301, "time_forward": 0.397880, "time_loss": 0.000246}
[03/27 23:03:31] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "2250", "eta": "2:57:30", "loss": 0.137592, "lr": 0.003797, "mode": "train", "time_backward": 1.062335, "time_data": 0.017665, "time_diff": 1.481490, "time_forward": 0.399634, "time_loss": 0.000292}
[03/27 23:04:17] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "2260", "eta": "2:56:41", "loss": 0.121670, "lr": 0.003813, "mode": "train", "time_backward": 1.081382, "time_data": 0.026742, "time_diff": 1.522494, "time_forward": 0.410848, "time_loss": 0.000433}
[03/27 23:04:42] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "2270", "eta": "2:55:52", "loss": 0.118922, "lr": 0.003829, "mode": "train", "time_backward": 1.059579, "time_data": 0.017401, "time_diff": 1.482494, "time_forward": 0.397923, "time_loss": 0.000290}
[03/27 23:05:00] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "2280", "eta": "2:55:05", "loss": 0.145658, "lr": 0.003846, "mode": "train", "time_backward": 1.160260, "time_data": 0.016761, "time_diff": 1.592306, "time_forward": 0.398353, "time_loss": 0.000258}
[03/27 23:05:16] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "2290", "eta": "2:54:17", "loss": 0.121834, "lr": 0.003862, "mode": "train", "time_backward": 1.071407, "time_data": 0.017081, "time_diff": 1.495425, "time_forward": 0.398875, "time_loss": 0.000321}
[03/27 23:05:36] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "2300", "eta": "2:53:32", "loss": 0.149506, "lr": 0.003878, "mode": "train", "time_backward": 1.086471, "time_data": 0.018969, "time_diff": 1.650284, "time_forward": 0.402464, "time_loss": 0.000256}
[03/27 23:05:52] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "2310", "eta": "2:52:46", "loss": 0.145275, "lr": 0.003895, "mode": "train", "time_backward": 1.133609, "time_data": 0.017244, "time_diff": 1.620534, "time_forward": 0.398157, "time_loss": 0.000374}
[03/27 23:06:09] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "2320", "eta": "2:52:00", "loss": 0.155428, "lr": 0.003911, "mode": "train", "time_backward": 1.088540, "time_data": 0.017510, "time_diff": 1.554656, "time_forward": 0.445022, "time_loss": 0.000421}
[03/27 23:06:26] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "2330", "eta": "2:51:14", "loss": 0.133696, "lr": 0.003927, "mode": "train", "time_backward": 1.070550, "time_data": 0.024817, "time_diff": 1.561760, "time_forward": 0.441823, "time_loss": 0.000260}
[03/27 23:06:41] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "2340", "eta": "2:50:27", "loss": 0.134719, "lr": 0.003944, "mode": "train", "time_backward": 1.084926, "time_data": 0.017260, "time_diff": 1.511239, "time_forward": 0.399840, "time_loss": 0.000324}
[03/27 23:06:57] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "2350", "eta": "2:49:40", "loss": 0.133587, "lr": 0.003960, "mode": "train", "time_backward": 1.069510, "time_data": 0.017297, "time_diff": 1.498164, "time_forward": 0.402779, "time_loss": 0.000352}
[03/27 23:07:23] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "2360", "eta": "2:48:54", "loss": 0.137073, "lr": 0.003976, "mode": "train", "time_backward": 1.091910, "time_data": 0.016682, "time_diff": 1.514172, "time_forward": 0.398554, "time_loss": 0.000244}
[03/27 23:07:48] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "2370", "eta": "2:48:08", "loss": 0.128021, "lr": 0.003993, "mode": "train", "time_backward": 1.055295, "time_data": 0.017949, "time_diff": 1.475446, "time_forward": 0.398810, "time_loss": 0.000206}
[03/27 23:08:10] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "2380", "eta": "2:47:22", "loss": 0.149981, "lr": 0.004009, "mode": "train", "time_backward": 1.065326, "time_data": 0.018604, "time_diff": 1.485630, "time_forward": 0.398605, "time_loss": 0.000283}
[03/27 23:08:26] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "2390", "eta": "2:46:36", "loss": 0.143681, "lr": 0.004025, "mode": "train", "time_backward": 1.066679, "time_data": 0.016926, "time_diff": 1.489198, "time_forward": 0.398324, "time_loss": 0.000252}
[03/27 23:08:41] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "2400", "eta": "2:45:51", "loss": 0.134327, "lr": 0.004042, "mode": "train", "time_backward": 1.056407, "time_data": 0.017984, "time_diff": 1.479524, "time_forward": 0.401563, "time_loss": 0.000389}
[03/27 23:09:18] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "2410", "eta": "2:45:08", "loss": 0.124413, "lr": 0.004058, "mode": "train", "time_backward": 1.073168, "time_data": 0.017689, "time_diff": 1.648930, "time_forward": 0.530654, "time_loss": 0.000465}
[03/27 23:09:34] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "2420", "eta": "2:44:23", "loss": 0.136511, "lr": 0.004074, "mode": "train", "time_backward": 1.053291, "time_data": 0.018793, "time_diff": 1.485617, "time_forward": 0.398573, "time_loss": 0.000271}
[03/27 23:09:50] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "2430", "eta": "2:43:39", "loss": 0.137897, "lr": 0.004091, "mode": "train", "time_backward": 1.054711, "time_data": 0.017404, "time_diff": 1.517786, "time_forward": 0.399403, "time_loss": 0.000281}
[03/27 23:10:19] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "2440", "eta": "2:42:55", "loss": 0.139312, "lr": 0.004107, "mode": "train", "time_backward": 1.054599, "time_data": 0.016886, "time_diff": 1.521068, "time_forward": 0.398587, "time_loss": 0.000261}
[03/27 23:11:13] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "2450", "eta": "2:42:14", "loss": 0.142882, "lr": 0.004123, "mode": "train", "time_backward": 1.063901, "time_data": 0.017260, "time_diff": 1.690412, "time_forward": 0.422587, "time_loss": 0.000248}
[03/27 23:11:29] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "2460", "eta": "2:41:31", "loss": 0.152692, "lr": 0.004139, "mode": "train", "time_backward": 1.084447, "time_data": 0.019018, "time_diff": 1.552396, "time_forward": 0.435771, "time_loss": 0.001112}
[03/27 23:11:44] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "2470", "eta": "2:40:48", "loss": 0.136883, "lr": 0.004156, "mode": "train", "time_backward": 1.057105, "time_data": 0.039774, "time_diff": 1.558477, "time_forward": 0.399328, "time_loss": 0.000346}
[03/27 23:12:00] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "2480", "eta": "2:40:06", "loss": 0.125673, "lr": 0.004172, "mode": "train", "time_backward": 1.053723, "time_data": 0.019137, "time_diff": 1.542020, "time_forward": 0.408267, "time_loss": 0.000403}
[03/27 23:12:15] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "2490", "eta": "2:39:25", "loss": 0.150682, "lr": 0.004188, "mode": "train", "time_backward": 1.127893, "time_data": 0.018799, "time_diff": 1.674019, "time_forward": 0.398474, "time_loss": 0.000277}
[03/27 23:12:30] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "2500", "eta": "2:38:42", "loss": 0.162422, "lr": 0.004205, "mode": "train", "time_backward": 1.055198, "time_data": 0.017184, "time_diff": 1.478557, "time_forward": 0.399588, "time_loss": 0.000345}
[03/27 23:12:58] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "2510", "eta": "2:37:59", "loss": 0.117301, "lr": 0.004221, "mode": "train", "time_backward": 1.050414, "time_data": 0.017151, "time_diff": 1.473485, "time_forward": 0.398367, "time_loss": 0.000229}
[03/27 23:13:32] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "2520", "eta": "2:37:17", "loss": 0.138451, "lr": 0.004237, "mode": "train", "time_backward": 1.125202, "time_data": 0.018349, "time_diff": 1.549260, "time_forward": 0.398519, "time_loss": 0.000268}
[03/27 23:13:55] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "2530", "eta": "2:36:34", "loss": 0.141493, "lr": 0.004254, "mode": "train", "time_backward": 1.054530, "time_data": 0.017191, "time_diff": 1.476667, "time_forward": 0.398719, "time_loss": 0.000266}
[03/27 23:14:15] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "2540", "eta": "2:35:52", "loss": 0.142643, "lr": 0.004270, "mode": "train", "time_backward": 1.054098, "time_data": 0.017680, "time_diff": 1.476290, "time_forward": 0.398430, "time_loss": 0.000243}
[03/27 23:14:35] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "2550", "eta": "2:35:16", "loss": 0.124853, "lr": 0.004286, "mode": "train", "time_backward": 1.190397, "time_data": 0.017330, "time_diff": 1.927042, "time_forward": 0.715875, "time_loss": 0.000396}
[03/27 23:14:54] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "2560", "eta": "2:34:34", "loss": 0.121114, "lr": 0.004303, "mode": "train", "time_backward": 1.056989, "time_data": 0.017000, "time_diff": 1.479039, "time_forward": 0.398469, "time_loss": 0.000280}
[03/27 23:15:10] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "2570", "eta": "2:33:54", "loss": 0.131349, "lr": 0.004319, "mode": "train", "time_backward": 1.061567, "time_data": 0.025252, "time_diff": 1.573951, "time_forward": 0.483414, "time_loss": 0.000387}
[03/27 23:15:25] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "2580", "eta": "2:33:14", "loss": 0.136651, "lr": 0.004335, "mode": "train", "time_backward": 1.159337, "time_data": 0.020530, "time_diff": 1.589564, "time_forward": 0.406118, "time_loss": 0.000270}
[03/27 23:15:40] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "2590", "eta": "2:32:32", "loss": 0.153531, "lr": 0.004352, "mode": "train", "time_backward": 1.055202, "time_data": 0.017721, "time_diff": 1.480077, "time_forward": 0.403567, "time_loss": 0.000357}
[03/27 23:15:56] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "2600", "eta": "2:31:52", "loss": 0.127755, "lr": 0.004368, "mode": "train", "time_backward": 1.063302, "time_data": 0.028175, "time_diff": 1.504415, "time_forward": 0.398334, "time_loss": 0.000423}
[03/27 23:16:23] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "2610", "eta": "2:31:12", "loss": 0.111036, "lr": 0.004384, "mode": "train", "time_backward": 1.161145, "time_data": 0.016867, "time_diff": 1.582223, "time_forward": 0.397699, "time_loss": 0.000239}
[03/27 23:16:40] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "2620", "eta": "2:30:33", "loss": 0.130330, "lr": 0.004401, "mode": "train", "time_backward": 1.060968, "time_data": 0.019568, "time_diff": 1.628259, "time_forward": 0.499268, "time_loss": 0.000255}
[03/27 23:17:03] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "2630", "eta": "2:29:53", "loss": 0.129662, "lr": 0.004417, "mode": "train", "time_backward": 1.062885, "time_data": 0.017632, "time_diff": 1.485848, "time_forward": 0.399941, "time_loss": 0.000382}
[03/27 23:17:30] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "2640", "eta": "2:29:13", "loss": 0.141044, "lr": 0.004433, "mode": "train", "time_backward": 1.053574, "time_data": 0.017105, "time_diff": 1.475310, "time_forward": 0.397254, "time_loss": 0.000279}
[03/27 23:17:59] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "2650", "eta": "2:28:32", "loss": 0.143302, "lr": 0.004450, "mode": "train", "time_backward": 1.056282, "time_data": 0.017274, "time_diff": 1.479722, "time_forward": 0.399491, "time_loss": 0.000364}
[03/27 23:18:39] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "2660", "eta": "2:27:53", "loss": 0.133259, "lr": 0.004466, "mode": "train", "time_backward": 1.069947, "time_data": 0.017067, "time_diff": 1.492629, "time_forward": 0.397853, "time_loss": 0.000251}
[03/27 23:18:58] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "2670", "eta": "2:27:36", "loss": 0.125340, "lr": 0.004482, "mode": "train", "time_backward": 1.572953, "time_data": 0.029582, "time_diff": 3.242722, "time_forward": 1.557503, "time_loss": 0.074367}
[03/27 23:19:14] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "2680", "eta": "2:26:56", "loss": 0.155224, "lr": 0.004499, "mode": "train", "time_backward": 1.087538, "time_data": 0.017807, "time_diff": 1.523232, "time_forward": 0.406478, "time_loss": 0.000298}
[03/27 23:19:29] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "2690", "eta": "2:26:19", "loss": 0.118515, "lr": 0.004515, "mode": "train", "time_backward": 1.077065, "time_data": 0.016967, "time_diff": 1.678257, "time_forward": 0.398677, "time_loss": 0.000262}
[03/27 23:19:45] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "2700", "eta": "2:25:40", "loss": 0.130386, "lr": 0.004531, "mode": "train", "time_backward": 1.055517, "time_data": 0.017154, "time_diff": 1.480072, "time_forward": 0.397915, "time_loss": 0.000248}
[03/27 23:20:00] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "2710", "eta": "2:25:01", "loss": 0.140610, "lr": 0.004548, "mode": "train", "time_backward": 1.055585, "time_data": 0.017954, "time_diff": 1.542389, "time_forward": 0.465251, "time_loss": 0.000445}
[03/27 23:20:15] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "2720", "eta": "2:24:22", "loss": 0.139118, "lr": 0.004564, "mode": "train", "time_backward": 1.056766, "time_data": 0.017981, "time_diff": 1.476662, "time_forward": 0.398445, "time_loss": 0.000253}
[03/27 23:20:31] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "2730", "eta": "2:23:44", "loss": 0.124597, "lr": 0.004580, "mode": "train", "time_backward": 1.060666, "time_data": 0.021907, "time_diff": 1.503014, "time_forward": 0.407773, "time_loss": 0.000741}
[03/27 23:20:54] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "2740", "eta": "2:23:06", "loss": 0.142598, "lr": 0.004597, "mode": "train", "time_backward": 1.093229, "time_data": 0.017098, "time_diff": 1.518017, "time_forward": 0.399637, "time_loss": 0.000303}
[03/27 23:21:13] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "2750", "eta": "2:23:15", "loss": 0.122122, "lr": 0.004613, "mode": "train", "time_backward": 1.057193, "time_data": 3.910704, "time_diff": 5.409237, "time_forward": 0.437597, "time_loss": 0.000369}
[03/27 23:21:32] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "2760", "eta": "2:22:38", "loss": 0.142807, "lr": 0.004629, "mode": "train", "time_backward": 1.122353, "time_data": 0.017616, "time_diff": 1.606769, "time_forward": 0.457415, "time_loss": 0.000384}
[03/27 23:21:50] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "2770", "eta": "2:22:00", "loss": 0.126460, "lr": 0.004646, "mode": "train", "time_backward": 1.054884, "time_data": 0.017160, "time_diff": 1.480104, "time_forward": 0.399000, "time_loss": 0.000257}
[03/27 23:22:10] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "2780", "eta": "2:21:22", "loss": 0.132166, "lr": 0.004662, "mode": "train", "time_backward": 1.071291, "time_data": 0.017524, "time_diff": 1.528324, "time_forward": 0.431760, "time_loss": 0.000387}
[03/27 23:22:25] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "2790", "eta": "2:20:48", "loss": 0.143267, "lr": 0.004678, "mode": "train", "time_backward": 1.135456, "time_data": 0.018499, "time_diff": 1.824916, "time_forward": 0.666292, "time_loss": 0.000445}
[03/27 23:22:42] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "2800", "eta": "2:20:10", "loss": 0.132437, "lr": 0.004695, "mode": "train", "time_backward": 1.056917, "time_data": 0.016989, "time_diff": 1.484479, "time_forward": 0.407053, "time_loss": 0.000320}
[03/27 23:23:20] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "2810", "eta": "2:19:32", "loss": 0.136401, "lr": 0.004711, "mode": "train", "time_backward": 1.075690, "time_data": 0.018033, "time_diff": 1.517934, "time_forward": 0.413511, "time_loss": 0.000484}
[03/27 23:23:39] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "2820", "eta": "2:18:56", "loss": 0.140885, "lr": 0.004727, "mode": "train", "time_backward": 1.062024, "time_data": 0.029521, "time_diff": 1.569554, "time_forward": 0.411016, "time_loss": 0.000362}
[03/27 23:23:54] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "2830", "eta": "2:18:19", "loss": 0.129211, "lr": 0.004744, "mode": "train", "time_backward": 1.088475, "time_data": 0.016997, "time_diff": 1.519830, "time_forward": 0.398528, "time_loss": 0.000234}
[03/27 23:24:10] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "2840", "eta": "2:17:42", "loss": 0.137363, "lr": 0.004760, "mode": "train", "time_backward": 1.063895, "time_data": 0.018334, "time_diff": 1.519756, "time_forward": 0.410125, "time_loss": 0.023202}
[03/27 23:24:26] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "2850", "eta": "2:17:05", "loss": 0.141219, "lr": 0.004776, "mode": "train", "time_backward": 1.098124, "time_data": 0.019501, "time_diff": 1.560390, "time_forward": 0.409522, "time_loss": 0.000374}
[03/27 23:24:43] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "2860", "eta": "2:16:28", "loss": 0.134479, "lr": 0.004793, "mode": "train", "time_backward": 1.061568, "time_data": 0.017792, "time_diff": 1.486884, "time_forward": 0.399803, "time_loss": 0.000356}
[03/27 23:24:58] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "2870", "eta": "2:15:52", "loss": 0.129154, "lr": 0.004809, "mode": "train", "time_backward": 1.086291, "time_data": 0.016646, "time_diff": 1.509387, "time_forward": 0.398752, "time_loss": 0.000265}
[03/27 23:25:14] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "2880", "eta": "2:15:17", "loss": 0.124294, "lr": 0.004825, "mode": "train", "time_backward": 1.164222, "time_data": 0.020367, "time_diff": 1.610556, "time_forward": 0.410533, "time_loss": 0.000298}
[03/27 23:25:30] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "2890", "eta": "2:14:51", "loss": 0.141952, "lr": 0.004841, "mode": "train", "time_backward": 1.626417, "time_data": 0.017595, "time_diff": 2.498363, "time_forward": 0.449896, "time_loss": 0.000331}
[03/27 23:25:49] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "2900", "eta": "2:14:15", "loss": 0.127002, "lr": 0.004858, "mode": "train", "time_backward": 1.055097, "time_data": 0.017752, "time_diff": 1.476285, "time_forward": 0.399589, "time_loss": 0.000329}
[03/27 23:26:04] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "2910", "eta": "2:13:39", "loss": 0.128897, "lr": 0.004874, "mode": "train", "time_backward": 1.055134, "time_data": 0.017552, "time_diff": 1.512075, "time_forward": 0.398658, "time_loss": 0.000249}
[03/27 23:26:24] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "2920", "eta": "2:13:03", "loss": 0.133771, "lr": 0.004890, "mode": "train", "time_backward": 1.128278, "time_data": 0.017522, "time_diff": 1.576559, "time_forward": 0.426630, "time_loss": 0.000286}
[03/27 23:26:40] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "2930", "eta": "2:12:30", "loss": 0.136891, "lr": 0.004907, "mode": "train", "time_backward": 1.203615, "time_data": 0.021662, "time_diff": 1.710351, "time_forward": 0.404502, "time_loss": 0.000261}
[03/27 23:26:56] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "2940", "eta": "2:11:54", "loss": 0.139874, "lr": 0.004923, "mode": "train", "time_backward": 1.077221, "time_data": 0.017437, "time_diff": 1.506363, "time_forward": 0.400045, "time_loss": 0.000615}
[03/27 23:27:12] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "2950", "eta": "2:11:19", "loss": 0.113291, "lr": 0.004939, "mode": "train", "time_backward": 1.073655, "time_data": 0.016626, "time_diff": 1.527692, "time_forward": 0.416131, "time_loss": 0.000241}
[03/27 23:27:28] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "2960", "eta": "2:10:44", "loss": 0.145911, "lr": 0.004956, "mode": "train", "time_backward": 1.139462, "time_data": 0.017403, "time_diff": 1.594765, "time_forward": 0.398954, "time_loss": 0.000262}
[03/27 23:27:44] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "2970", "eta": "2:10:11", "loss": 0.128478, "lr": 0.004972, "mode": "train", "time_backward": 1.104294, "time_data": 0.032111, "time_diff": 1.741504, "time_forward": 0.565410, "time_loss": 0.000352}
[03/27 23:28:00] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "2980", "eta": "2:09:36", "loss": 0.147742, "lr": 0.004988, "mode": "train", "time_backward": 1.062657, "time_data": 0.018987, "time_diff": 1.510335, "time_forward": 0.406531, "time_loss": 0.000687}
[03/27 23:28:17] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "2990", "eta": "2:09:03", "loss": 0.152391, "lr": 0.005005, "mode": "train", "time_backward": 1.180226, "time_data": 0.021479, "time_diff": 1.657950, "time_forward": 0.438660, "time_loss": 0.000267}
[03/27 23:28:33] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "3000", "eta": "2:08:29", "loss": 0.133527, "lr": 0.005021, "mode": "train", "time_backward": 1.123879, "time_data": 0.021077, "time_diff": 1.602542, "time_forward": 0.401372, "time_loss": 0.000277}
[03/27 23:28:48] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "3010", "eta": "2:07:55", "loss": 0.147156, "lr": 0.005037, "mode": "train", "time_backward": 1.115201, "time_data": 0.019469, "time_diff": 1.543217, "time_forward": 0.398589, "time_loss": 0.000266}
[03/27 23:29:04] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "3020", "eta": "2:07:20", "loss": 0.115897, "lr": 0.005054, "mode": "train", "time_backward": 1.072369, "time_data": 0.017924, "time_diff": 1.505628, "time_forward": 0.410192, "time_loss": 0.000364}
[03/27 23:29:21] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "3030", "eta": "2:06:46", "loss": 0.137190, "lr": 0.005070, "mode": "train", "time_backward": 1.054897, "time_data": 0.016722, "time_diff": 1.550494, "time_forward": 0.475236, "time_loss": 0.000242}
[03/27 23:29:37] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "3040", "eta": "2:06:11", "loss": 0.134732, "lr": 0.005086, "mode": "train", "time_backward": 1.062010, "time_data": 0.021457, "time_diff": 1.487041, "time_forward": 0.400875, "time_loss": 0.000251}
[03/27 23:29:52] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "3050", "eta": "2:05:37", "loss": 0.119003, "lr": 0.005103, "mode": "train", "time_backward": 1.072340, "time_data": 0.019315, "time_diff": 1.565706, "time_forward": 0.415033, "time_loss": 0.000457}
[03/27 23:30:08] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "3060", "eta": "2:05:04", "loss": 0.114614, "lr": 0.005119, "mode": "train", "time_backward": 1.099554, "time_data": 0.021183, "time_diff": 1.530827, "time_forward": 0.407371, "time_loss": 0.002432}
[03/27 23:30:24] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "3070", "eta": "2:04:30", "loss": 0.123713, "lr": 0.005135, "mode": "train", "time_backward": 1.059455, "time_data": 0.020116, "time_diff": 1.544490, "time_forward": 0.461359, "time_loss": 0.000249}
[03/27 23:30:40] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "3080", "eta": "2:03:56", "loss": 0.128908, "lr": 0.005152, "mode": "train", "time_backward": 1.056634, "time_data": 0.027477, "time_diff": 1.539547, "time_forward": 0.451814, "time_loss": 0.000250}
[03/27 23:30:56] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "3090", "eta": "2:03:23", "loss": 0.140819, "lr": 0.005168, "mode": "train", "time_backward": 1.055463, "time_data": 0.016509, "time_diff": 1.538784, "time_forward": 0.397439, "time_loss": 0.000271}
[03/27 23:31:12] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "3100", "eta": "2:02:49", "loss": 0.148170, "lr": 0.005184, "mode": "train", "time_backward": 1.054796, "time_data": 0.017684, "time_diff": 1.477322, "time_forward": 0.399428, "time_loss": 0.000435}
[03/27 23:31:33] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "3110", "eta": "2:02:15", "loss": 0.144016, "lr": 0.005201, "mode": "train", "time_backward": 1.062475, "time_data": 0.017143, "time_diff": 1.509845, "time_forward": 0.425438, "time_loss": 0.000369}
[03/27 23:31:48] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "3120", "eta": "2:01:42", "loss": 0.126997, "lr": 0.005217, "mode": "train", "time_backward": 1.064303, "time_data": 0.023719, "time_diff": 1.528083, "time_forward": 0.402820, "time_loss": 0.000381}
[03/27 23:32:04] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "3130", "eta": "2:01:09", "loss": 0.122968, "lr": 0.005233, "mode": "train", "time_backward": 1.073036, "time_data": 0.021278, "time_diff": 1.553616, "time_forward": 0.454675, "time_loss": 0.000326}
[03/27 23:32:20] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "3140", "eta": "2:00:37", "loss": 0.159450, "lr": 0.005250, "mode": "train", "time_backward": 1.121238, "time_data": 0.018198, "time_diff": 1.625063, "time_forward": 0.481915, "time_loss": 0.000339}
[03/27 23:32:36] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "3150", "eta": "2:00:04", "loss": 0.143888, "lr": 0.005266, "mode": "train", "time_backward": 1.108237, "time_data": 0.021128, "time_diff": 1.550082, "time_forward": 0.398469, "time_loss": 0.000235}
[03/27 23:32:52] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "3160", "eta": "1:59:32", "loss": 0.115143, "lr": 0.005282, "mode": "train", "time_backward": 1.053616, "time_data": 0.046265, "time_diff": 1.531739, "time_forward": 0.418368, "time_loss": 0.000347}
[03/27 23:33:08] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "3170", "eta": "1:58:59", "loss": 0.148942, "lr": 0.005299, "mode": "train", "time_backward": 1.097473, "time_data": 0.017085, "time_diff": 1.518509, "time_forward": 0.398108, "time_loss": 0.000264}
[03/27 23:33:23] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "3180", "eta": "1:58:26", "loss": 0.133821, "lr": 0.005315, "mode": "train", "time_backward": 1.078193, "time_data": 0.018875, "time_diff": 1.502524, "time_forward": 0.398607, "time_loss": 0.000303}
[03/27 23:33:39] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "3190", "eta": "1:57:53", "loss": 0.121849, "lr": 0.005331, "mode": "train", "time_backward": 1.057800, "time_data": 0.017794, "time_diff": 1.479049, "time_forward": 0.399977, "time_loss": 0.000282}
[03/27 23:33:54] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "3200", "eta": "1:57:21", "loss": 0.117748, "lr": 0.005348, "mode": "train", "time_backward": 1.083693, "time_data": 0.017435, "time_diff": 1.564807, "time_forward": 0.463121, "time_loss": 0.000260}
[03/27 23:34:12] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "3210", "eta": "1:56:49", "loss": 0.135855, "lr": 0.005364, "mode": "train", "time_backward": 1.054204, "time_data": 0.019847, "time_diff": 1.580645, "time_forward": 0.503141, "time_loss": 0.000276}
[03/27 23:34:28] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "3220", "eta": "1:56:17", "loss": 0.150536, "lr": 0.005380, "mode": "train", "time_backward": 1.054973, "time_data": 0.033708, "time_diff": 1.544429, "time_forward": 0.452060, "time_loss": 0.000393}
[03/27 23:34:44] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "3230", "eta": "1:55:45", "loss": 0.134762, "lr": 0.005397, "mode": "train", "time_backward": 1.054238, "time_data": 0.045883, "time_diff": 1.519900, "time_forward": 0.416188, "time_loss": 0.000263}
[03/27 23:34:59] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "3240", "eta": "1:55:13", "loss": 0.142440, "lr": 0.005413, "mode": "train", "time_backward": 1.052840, "time_data": 0.020454, "time_diff": 1.491035, "time_forward": 0.397949, "time_loss": 0.000290}
[03/27 23:35:15] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "3250", "eta": "1:54:42", "loss": 0.136161, "lr": 0.005429, "mode": "train", "time_backward": 1.092604, "time_data": 0.018099, "time_diff": 1.584420, "time_forward": 0.465787, "time_loss": 0.000288}
[03/27 23:35:30] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "3260", "eta": "1:54:10", "loss": 0.140647, "lr": 0.005446, "mode": "train", "time_backward": 1.128748, "time_data": 0.016800, "time_diff": 1.555050, "time_forward": 0.398251, "time_loss": 0.000298}
[03/27 23:35:55] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "3270", "eta": "1:53:39", "loss": 0.122466, "lr": 0.005462, "mode": "train", "time_backward": 1.090313, "time_data": 0.018585, "time_diff": 1.516831, "time_forward": 0.402289, "time_loss": 0.000438}
[03/27 23:36:10] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "3280", "eta": "1:53:07", "loss": 0.126974, "lr": 0.005478, "mode": "train", "time_backward": 1.057754, "time_data": 0.016894, "time_diff": 1.478568, "time_forward": 0.398870, "time_loss": 0.000279}
[03/27 23:36:25] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "3290", "eta": "1:52:35", "loss": 0.138571, "lr": 0.005495, "mode": "train", "time_backward": 1.060263, "time_data": 0.062639, "time_diff": 1.544005, "time_forward": 0.399402, "time_loss": 0.000415}
[03/27 23:36:41] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "3300", "eta": "1:52:04", "loss": 0.144728, "lr": 0.005511, "mode": "train", "time_backward": 1.122988, "time_data": 0.019886, "time_diff": 1.549913, "time_forward": 0.402995, "time_loss": 0.000220}
[03/27 23:36:56] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "3310", "eta": "1:51:33", "loss": 0.117859, "lr": 0.005527, "mode": "train", "time_backward": 1.136432, "time_data": 0.017020, "time_diff": 1.584001, "time_forward": 0.398807, "time_loss": 0.000327}
[03/27 23:37:12] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "3320", "eta": "1:51:02", "loss": 0.115517, "lr": 0.005543, "mode": "train", "time_backward": 1.066233, "time_data": 0.020168, "time_diff": 1.512117, "time_forward": 0.415550, "time_loss": 0.000656}
[03/27 23:37:29] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "3330", "eta": "1:50:31", "loss": 0.130796, "lr": 0.005560, "mode": "train", "time_backward": 1.065084, "time_data": 0.022561, "time_diff": 1.501276, "time_forward": 0.399045, "time_loss": 0.000360}
[03/27 23:37:44] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "3340", "eta": "1:50:00", "loss": 0.124263, "lr": 0.005576, "mode": "train", "time_backward": 1.059040, "time_data": 0.020718, "time_diff": 1.524870, "time_forward": 0.435577, "time_loss": 0.000304}
[03/27 23:37:59] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "3350", "eta": "1:49:29", "loss": 0.136631, "lr": 0.005592, "mode": "train", "time_backward": 1.049554, "time_data": 0.017751, "time_diff": 1.473932, "time_forward": 0.399291, "time_loss": 0.000249}
[03/27 23:38:14] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "3360", "eta": "1:48:57", "loss": 0.108718, "lr": 0.005609, "mode": "train", "time_backward": 1.053189, "time_data": 0.016835, "time_diff": 1.473144, "time_forward": 0.399594, "time_loss": 0.000227}
[03/27 23:38:54] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "3370", "eta": "1:48:27", "loss": 0.131418, "lr": 0.005625, "mode": "train", "time_backward": 1.063098, "time_data": 0.017024, "time_diff": 1.512186, "time_forward": 0.424912, "time_loss": 0.000359}
[03/27 23:39:10] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "3380", "eta": "1:47:56", "loss": 0.128914, "lr": 0.005641, "mode": "train", "time_backward": 1.110792, "time_data": 0.020882, "time_diff": 1.572782, "time_forward": 0.399852, "time_loss": 0.000255}
[03/27 23:39:26] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "3390", "eta": "1:47:26", "loss": 0.128424, "lr": 0.005658, "mode": "train", "time_backward": 1.078247, "time_data": 0.018764, "time_diff": 1.504388, "time_forward": 0.399770, "time_loss": 0.000396}
[03/27 23:39:46] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "3400", "eta": "1:46:56", "loss": 0.143892, "lr": 0.005674, "mode": "train", "time_backward": 1.082530, "time_data": 0.017409, "time_diff": 1.641101, "time_forward": 0.537496, "time_loss": 0.000802}
[03/27 23:40:07] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "3410", "eta": "1:46:26", "loss": 0.134744, "lr": 0.005690, "mode": "train", "time_backward": 1.139996, "time_data": 0.016769, "time_diff": 1.604362, "time_forward": 0.400763, "time_loss": 0.000246}
[03/27 23:40:23] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "3420", "eta": "1:45:57", "loss": 0.110704, "lr": 0.005707, "mode": "train", "time_backward": 1.067740, "time_data": 0.017272, "time_diff": 1.580715, "time_forward": 0.492149, "time_loss": 0.000431}
[03/27 23:40:38] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "3430", "eta": "1:45:26", "loss": 0.113775, "lr": 0.005723, "mode": "train", "time_backward": 1.077615, "time_data": 0.016642, "time_diff": 1.504282, "time_forward": 0.400801, "time_loss": 0.000393}
[03/27 23:40:54] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "3440", "eta": "1:44:56", "loss": 0.141810, "lr": 0.005739, "mode": "train", "time_backward": 1.053909, "time_data": 0.032871, "time_diff": 1.551898, "time_forward": 0.461439, "time_loss": 0.000295}
[03/27 23:41:10] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "3450", "eta": "1:44:27", "loss": 0.137456, "lr": 0.005756, "mode": "train", "time_backward": 1.095746, "time_data": 0.022847, "time_diff": 1.539082, "time_forward": 0.398953, "time_loss": 0.015693}
[03/27 23:41:26] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "3460", "eta": "1:43:57", "loss": 0.127043, "lr": 0.005772, "mode": "train", "time_backward": 1.104022, "time_data": 0.068876, "time_diff": 1.609363, "time_forward": 0.412450, "time_loss": 0.000285}
[03/27 23:41:41] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "3470", "eta": "1:43:27", "loss": 0.146606, "lr": 0.005788, "mode": "train", "time_backward": 1.052299, "time_data": 0.017396, "time_diff": 1.474638, "time_forward": 0.397792, "time_loss": 0.000225}
[03/27 23:41:57] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "3480", "eta": "1:42:58", "loss": 0.126110, "lr": 0.005805, "mode": "train", "time_backward": 1.053116, "time_data": 0.023860, "time_diff": 1.574244, "time_forward": 0.483154, "time_loss": 0.000957}
[03/27 23:42:13] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "3490", "eta": "1:42:28", "loss": 0.116696, "lr": 0.005821, "mode": "train", "time_backward": 1.100812, "time_data": 0.017474, "time_diff": 1.541027, "time_forward": 0.410685, "time_loss": 0.000299}
[03/27 23:42:29] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "3500", "eta": "1:41:58", "loss": 0.123691, "lr": 0.005837, "mode": "train", "time_backward": 1.068175, "time_data": 0.017553, "time_diff": 1.493950, "time_forward": 0.397764, "time_loss": 0.000305}
[03/27 23:42:47] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "3510", "eta": "1:41:28", "loss": 0.121467, "lr": 0.005854, "mode": "train", "time_backward": 1.051336, "time_data": 0.017676, "time_diff": 1.477678, "time_forward": 0.403275, "time_loss": 0.000409}
[03/27 23:43:06] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "3520", "eta": "1:40:59", "loss": 0.120135, "lr": 0.005870, "mode": "train", "time_backward": 1.087014, "time_data": 0.019847, "time_diff": 1.514292, "time_forward": 0.399534, "time_loss": 0.000347}
[03/27 23:43:22] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "3530", "eta": "1:40:30", "loss": 0.136691, "lr": 0.005886, "mode": "train", "time_backward": 1.062624, "time_data": 0.018815, "time_diff": 1.527882, "time_forward": 0.425265, "time_loss": 0.000502}
[03/27 23:43:38] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "3540", "eta": "1:40:01", "loss": 0.114123, "lr": 0.005903, "mode": "train", "time_backward": 1.069068, "time_data": 0.017746, "time_diff": 1.626591, "time_forward": 0.499111, "time_loss": 0.000252}
[03/27 23:43:54] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "3550", "eta": "1:39:32", "loss": 0.132588, "lr": 0.005919, "mode": "train", "time_backward": 1.063195, "time_data": 0.016829, "time_diff": 1.484231, "time_forward": 0.398044, "time_loss": 0.000296}
[03/27 23:44:09] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "3560", "eta": "1:39:02", "loss": 0.145656, "lr": 0.005935, "mode": "train", "time_backward": 1.055361, "time_data": 0.017214, "time_diff": 1.504910, "time_forward": 0.398390, "time_loss": 0.000462}
[03/27 23:45:03] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "3570", "eta": "1:38:33", "loss": 0.132544, "lr": 0.005952, "mode": "train", "time_backward": 1.073562, "time_data": 0.019418, "time_diff": 1.515829, "time_forward": 0.399722, "time_loss": 0.000238}
[03/27 23:45:18] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "3580", "eta": "1:38:04", "loss": 0.130171, "lr": 0.005968, "mode": "train", "time_backward": 1.084287, "time_data": 0.018507, "time_diff": 1.512662, "time_forward": 0.406007, "time_loss": 0.000602}
[03/27 23:45:42] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "3590", "eta": "1:37:35", "loss": 0.132218, "lr": 0.005984, "mode": "train", "time_backward": 1.070138, "time_data": 0.017472, "time_diff": 1.487943, "time_forward": 0.399605, "time_loss": 0.000297}
[03/27 23:45:58] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "3600", "eta": "1:37:07", "loss": 0.127269, "lr": 0.006001, "mode": "train", "time_backward": 1.113626, "time_data": 0.018464, "time_diff": 1.555229, "time_forward": 0.400522, "time_loss": 0.000262}
[03/27 23:46:13] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "3610", "eta": "1:36:39", "loss": 0.121331, "lr": 0.006017, "mode": "train", "time_backward": 1.122843, "time_data": 0.026510, "time_diff": 1.621828, "time_forward": 0.429011, "time_loss": 0.041044}
[03/27 23:46:29] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "3620", "eta": "1:36:11", "loss": 0.131837, "lr": 0.006033, "mode": "train", "time_backward": 1.149289, "time_data": 0.021452, "time_diff": 1.619263, "time_forward": 0.419126, "time_loss": 0.000402}
[03/27 23:46:45] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "3630", "eta": "1:35:42", "loss": 0.129987, "lr": 0.006050, "mode": "train", "time_backward": 1.106444, "time_data": 0.017724, "time_diff": 1.561088, "time_forward": 0.398683, "time_loss": 0.000304}
[03/27 23:47:00] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "3640", "eta": "1:35:14", "loss": 0.126117, "lr": 0.006066, "mode": "train", "time_backward": 1.057869, "time_data": 0.020928, "time_diff": 1.543994, "time_forward": 0.464044, "time_loss": 0.000277}
[03/27 23:47:16] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "3650", "eta": "1:34:46", "loss": 0.130328, "lr": 0.006082, "mode": "train", "time_backward": 1.120917, "time_data": 0.017877, "time_diff": 1.546150, "time_forward": 0.403961, "time_loss": 0.000271}
[03/27 23:47:33] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "3660", "eta": "1:34:17", "loss": 0.134852, "lr": 0.006099, "mode": "train", "time_backward": 1.056081, "time_data": 0.016945, "time_diff": 1.533671, "time_forward": 0.404844, "time_loss": 0.000342}
[03/27 23:47:49] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "3670", "eta": "1:33:49", "loss": 0.123057, "lr": 0.006115, "mode": "train", "time_backward": 1.077298, "time_data": 0.017157, "time_diff": 1.513849, "time_forward": 0.398988, "time_loss": 0.000277}
[03/27 23:48:04] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "3680", "eta": "1:33:20", "loss": 0.119570, "lr": 0.006131, "mode": "train", "time_backward": 1.062176, "time_data": 0.016896, "time_diff": 1.482973, "time_forward": 0.400488, "time_loss": 0.000358}
[03/27 23:48:20] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "3690", "eta": "1:32:52", "loss": 0.142357, "lr": 0.006148, "mode": "train", "time_backward": 1.058929, "time_data": 0.019471, "time_diff": 1.524709, "time_forward": 0.417642, "time_loss": 0.000283}
[03/27 23:48:36] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "3700", "eta": "1:32:24", "loss": 0.122779, "lr": 0.006164, "mode": "train", "time_backward": 1.087478, "time_data": 0.017510, "time_diff": 1.542875, "time_forward": 0.420211, "time_loss": 0.000386}
[03/27 23:48:51] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "3710", "eta": "1:31:56", "loss": 0.123851, "lr": 0.006180, "mode": "train", "time_backward": 1.056603, "time_data": 0.017134, "time_diff": 1.520604, "time_forward": 0.399966, "time_loss": 0.000278}
[03/27 23:49:09] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "3720", "eta": "1:31:29", "loss": 0.128680, "lr": 0.006197, "mode": "train", "time_backward": 1.161349, "time_data": 0.025476, "time_diff": 1.593415, "time_forward": 0.404253, "time_loss": 0.000630}
[03/27 23:49:24] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "3730", "eta": "1:31:01", "loss": 0.131385, "lr": 0.006213, "mode": "train", "time_backward": 1.058692, "time_data": 0.026862, "time_diff": 1.515992, "time_forward": 0.426758, "time_loss": 0.000415}
[03/27 23:49:46] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "3740", "eta": "1:30:33", "loss": 0.119135, "lr": 0.006229, "mode": "train", "time_backward": 1.144573, "time_data": 0.016985, "time_diff": 1.568270, "time_forward": 0.403978, "time_loss": 0.000257}
[03/27 23:50:01] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "3750", "eta": "1:30:05", "loss": 0.125192, "lr": 0.006246, "mode": "train", "time_backward": 1.055492, "time_data": 0.023260, "time_diff": 1.496996, "time_forward": 0.414660, "time_loss": 0.000269}
[03/27 23:50:17] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "3760", "eta": "1:29:39", "loss": 0.120350, "lr": 0.006262, "mode": "train", "time_backward": 1.115106, "time_data": 0.019120, "time_diff": 1.673593, "time_forward": 0.535715, "time_loss": 0.000269}
[03/27 23:50:32] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "3770", "eta": "1:29:11", "loss": 0.133277, "lr": 0.006278, "mode": "train", "time_backward": 1.054136, "time_data": 0.016897, "time_diff": 1.472058, "time_forward": 0.397313, "time_loss": 0.000234}
[03/27 23:51:18] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "3780", "eta": "1:28:43", "loss": 0.125013, "lr": 0.006294, "mode": "train", "time_backward": 1.057119, "time_data": 0.017016, "time_diff": 1.545235, "time_forward": 0.441682, "time_loss": 0.000259}
[03/27 23:51:34] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "3790", "eta": "1:28:16", "loss": 0.130386, "lr": 0.006311, "mode": "train", "time_backward": 1.068300, "time_data": 0.031326, "time_diff": 1.622619, "time_forward": 0.511385, "time_loss": 0.001558}
[03/27 23:51:50] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "3800", "eta": "1:27:49", "loss": 0.132500, "lr": 0.006327, "mode": "train", "time_backward": 1.074254, "time_data": 0.018294, "time_diff": 1.537001, "time_forward": 0.415449, "time_loss": 0.000321}
[03/27 23:52:06] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "3810", "eta": "1:27:21", "loss": 0.113291, "lr": 0.006343, "mode": "train", "time_backward": 1.057021, "time_data": 0.016913, "time_diff": 1.483049, "time_forward": 0.399370, "time_loss": 0.000247}
[03/27 23:52:44] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "3820", "eta": "1:26:54", "loss": 0.121027, "lr": 0.006360, "mode": "train", "time_backward": 1.054181, "time_data": 0.016799, "time_diff": 1.480379, "time_forward": 0.403571, "time_loss": 0.000399}
[03/27 23:52:59] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "3830", "eta": "1:26:26", "loss": 0.128730, "lr": 0.006376, "mode": "train", "time_backward": 1.065392, "time_data": 0.017107, "time_diff": 1.500028, "time_forward": 0.410546, "time_loss": 0.000394}
[03/27 23:54:33] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "3840", "eta": "1:25:59", "loss": 0.124941, "lr": 0.006392, "mode": "train", "time_backward": 1.141826, "time_data": 0.016898, "time_diff": 1.563938, "time_forward": 0.398776, "time_loss": 0.000240}
[03/27 23:54:48] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "3850", "eta": "1:25:33", "loss": 0.130063, "lr": 0.006409, "mode": "train", "time_backward": 1.129780, "time_data": 0.023159, "time_diff": 1.571981, "time_forward": 0.410133, "time_loss": 0.000580}
[03/27 23:55:03] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "3860", "eta": "1:25:05", "loss": 0.116483, "lr": 0.006425, "mode": "train", "time_backward": 1.072518, "time_data": 0.017875, "time_diff": 1.509281, "time_forward": 0.412602, "time_loss": 0.000459}
[03/27 23:55:19] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "3870", "eta": "1:24:38", "loss": 0.146245, "lr": 0.006441, "mode": "train", "time_backward": 1.072163, "time_data": 0.022841, "time_diff": 1.512757, "time_forward": 0.397851, "time_loss": 0.000261}
[03/27 23:55:36] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "3880", "eta": "1:24:12", "loss": 0.128192, "lr": 0.006458, "mode": "train", "time_backward": 1.065317, "time_data": 0.025817, "time_diff": 1.536563, "time_forward": 0.399860, "time_loss": 0.000386}
[03/27 23:55:52] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "3890", "eta": "1:23:45", "loss": 0.129126, "lr": 0.006474, "mode": "train", "time_backward": 1.120543, "time_data": 0.018733, "time_diff": 1.557311, "time_forward": 0.408178, "time_loss": 0.000323}
[03/27 23:56:07] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "3900", "eta": "1:23:18", "loss": 0.133629, "lr": 0.006490, "mode": "train", "time_backward": 1.072872, "time_data": 0.019818, "time_diff": 1.536544, "time_forward": 0.426907, "time_loss": 0.000306}
[03/27 23:56:22] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "3910", "eta": "1:22:52", "loss": 0.123371, "lr": 0.006507, "mode": "train", "time_backward": 1.170306, "time_data": 0.018343, "time_diff": 1.605368, "time_forward": 0.409714, "time_loss": 0.000257}
[03/27 23:56:41] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "3920", "eta": "1:22:25", "loss": 0.142423, "lr": 0.006523, "mode": "train", "time_backward": 1.090252, "time_data": 0.018844, "time_diff": 1.514303, "time_forward": 0.401459, "time_loss": 0.000408}
[03/27 23:56:56] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "3930", "eta": "1:21:58", "loss": 0.119077, "lr": 0.006539, "mode": "train", "time_backward": 1.057191, "time_data": 0.017027, "time_diff": 1.476826, "time_forward": 0.399198, "time_loss": 0.000234}
[03/27 23:57:12] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "3940", "eta": "1:21:31", "loss": 0.129769, "lr": 0.006556, "mode": "train", "time_backward": 1.057907, "time_data": 0.017185, "time_diff": 1.482002, "time_forward": 0.401102, "time_loss": 0.000406}
[03/27 23:57:27] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "3950", "eta": "1:21:05", "loss": 0.154589, "lr": 0.006572, "mode": "train", "time_backward": 1.057780, "time_data": 0.016878, "time_diff": 1.476569, "time_forward": 0.398512, "time_loss": 0.000221}
[03/27 23:57:43] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "3960", "eta": "1:20:38", "loss": 0.130750, "lr": 0.006588, "mode": "train", "time_backward": 1.063100, "time_data": 0.016977, "time_diff": 1.503432, "time_forward": 0.402276, "time_loss": 0.000354}
[03/27 23:57:58] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "3970", "eta": "1:20:11", "loss": 0.118508, "lr": 0.006605, "mode": "train", "time_backward": 1.056038, "time_data": 0.017015, "time_diff": 1.478277, "time_forward": 0.399691, "time_loss": 0.000248}
[03/27 23:58:20] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "3980", "eta": "1:19:45", "loss": 0.142264, "lr": 0.006621, "mode": "train", "time_backward": 1.129305, "time_data": 0.022398, "time_diff": 1.591179, "time_forward": 0.427788, "time_loss": 0.000342}
[03/27 23:58:35] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "3990", "eta": "1:19:19", "loss": 0.128786, "lr": 0.006637, "mode": "train", "time_backward": 1.054718, "time_data": 0.017011, "time_diff": 1.476627, "time_forward": 0.399504, "time_loss": 0.000299}
[03/27 23:58:56] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "4000", "eta": "1:18:53", "loss": 0.125989, "lr": 0.006654, "mode": "train", "time_backward": 1.123758, "time_data": 0.016976, "time_diff": 1.551126, "time_forward": 0.406793, "time_loss": 0.000242}
[03/27 23:59:12] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "4010", "eta": "1:18:26", "loss": 0.115783, "lr": 0.006670, "mode": "train", "time_backward": 1.056265, "time_data": 0.017360, "time_diff": 1.480612, "time_forward": 0.403521, "time_loss": 0.000236}
[03/27 23:59:28] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "4020", "eta": "1:18:00", "loss": 0.132690, "lr": 0.006686, "mode": "train", "time_backward": 1.074437, "time_data": 0.017021, "time_diff": 1.518617, "time_forward": 0.424006, "time_loss": 0.000654}
[03/27 23:59:43] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "4030", "eta": "1:17:34", "loss": 0.129009, "lr": 0.006703, "mode": "train", "time_backward": 1.078883, "time_data": 0.017137, "time_diff": 1.500718, "time_forward": 0.401197, "time_loss": 0.000665}
[03/27 23:59:59] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "4040", "eta": "1:17:08", "loss": 0.124051, "lr": 0.006719, "mode": "train", "time_backward": 1.058396, "time_data": 0.020255, "time_diff": 1.511499, "time_forward": 0.398987, "time_loss": 0.000225}
[03/28 00:00:15] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "4050", "eta": "1:16:42", "loss": 0.115297, "lr": 0.006735, "mode": "train", "time_backward": 1.151686, "time_data": 0.019155, "time_diff": 1.581551, "time_forward": 0.407959, "time_loss": 0.000238}
[03/28 00:00:31] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "4060", "eta": "1:16:16", "loss": 0.138648, "lr": 0.006752, "mode": "train", "time_backward": 1.078254, "time_data": 0.018626, "time_diff": 1.520321, "time_forward": 0.419978, "time_loss": 0.000755}
[03/28 00:00:46] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "4070", "eta": "1:15:51", "loss": 0.107130, "lr": 0.006768, "mode": "train", "time_backward": 1.069227, "time_data": 0.017482, "time_diff": 1.561631, "time_forward": 0.467090, "time_loss": 0.000406}
[03/28 00:01:02] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "4080", "eta": "1:15:25", "loss": 0.115333, "lr": 0.006784, "mode": "train", "time_backward": 1.063115, "time_data": 0.025835, "time_diff": 1.513013, "time_forward": 0.420352, "time_loss": 0.000371}
[03/28 00:01:24] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "4090", "eta": "1:15:31", "loss": 0.117154, "lr": 0.006801, "mode": "train", "time_backward": 7.607336, "time_data": 0.017736, "time_diff": 8.034048, "time_forward": 0.404758, "time_loss": 0.000757}
[03/28 00:01:40] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "4100", "eta": "1:15:05", "loss": 0.116722, "lr": 0.006817, "mode": "train", "time_backward": 1.071973, "time_data": 0.017718, "time_diff": 1.502962, "time_forward": 0.398427, "time_loss": 0.000234}
[03/28 00:01:55] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "4110", "eta": "1:14:39", "loss": 0.128618, "lr": 0.006833, "mode": "train", "time_backward": 1.068090, "time_data": 0.016715, "time_diff": 1.543062, "time_forward": 0.444636, "time_loss": 0.000239}
[03/28 00:02:11] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "4120", "eta": "1:14:14", "loss": 0.150128, "lr": 0.006850, "mode": "train", "time_backward": 1.140400, "time_data": 0.023928, "time_diff": 1.583553, "time_forward": 0.416896, "time_loss": 0.000297}
[03/28 00:02:27] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "4130", "eta": "1:13:48", "loss": 0.135821, "lr": 0.006866, "mode": "train", "time_backward": 1.080059, "time_data": 0.016957, "time_diff": 1.501553, "time_forward": 0.398764, "time_loss": 0.000291}
[03/28 00:02:47] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "4140", "eta": "1:13:22", "loss": 0.118951, "lr": 0.006882, "mode": "train", "time_backward": 1.057961, "time_data": 0.036852, "time_diff": 1.504056, "time_forward": 0.404575, "time_loss": 0.000310}
[03/28 00:03:05] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "4150", "eta": "1:12:57", "loss": 0.116585, "lr": 0.006899, "mode": "train", "time_backward": 1.111993, "time_data": 0.019459, "time_diff": 1.719922, "time_forward": 0.579609, "time_loss": 0.000763}
[03/28 00:03:20] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "4160", "eta": "1:12:32", "loss": 0.117898, "lr": 0.006915, "mode": "train", "time_backward": 1.086000, "time_data": 0.017229, "time_diff": 1.534938, "time_forward": 0.398776, "time_loss": 0.000248}
[03/28 00:03:35] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "4170", "eta": "1:12:06", "loss": 0.125036, "lr": 0.006931, "mode": "train", "time_backward": 1.106118, "time_data": 0.016821, "time_diff": 1.531164, "time_forward": 0.399014, "time_loss": 0.000279}
[03/28 00:03:51] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "4180", "eta": "1:11:40", "loss": 0.125180, "lr": 0.006948, "mode": "train", "time_backward": 1.061441, "time_data": 0.017884, "time_diff": 1.490690, "time_forward": 0.403941, "time_loss": 0.000363}
[03/28 00:04:06] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "4190", "eta": "1:11:15", "loss": 0.142456, "lr": 0.006964, "mode": "train", "time_backward": 1.054955, "time_data": 0.017405, "time_diff": 1.476143, "time_forward": 0.399990, "time_loss": 0.000385}
[03/28 00:05:24] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "4200", "eta": "1:10:49", "loss": 0.126330, "lr": 0.006980, "mode": "train", "time_backward": 1.055378, "time_data": 0.017006, "time_diff": 1.476354, "time_forward": 0.398059, "time_loss": 0.000245}
[03/28 00:06:35] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "4210", "eta": "1:10:23", "loss": 0.123286, "lr": 0.006996, "mode": "train", "time_backward": 1.057498, "time_data": 0.017630, "time_diff": 1.481233, "time_forward": 0.400070, "time_loss": 0.000334}
[03/28 00:07:05] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "4220", "eta": "1:09:58", "loss": 0.117081, "lr": 0.007013, "mode": "train", "time_backward": 1.057904, "time_data": 0.016838, "time_diff": 1.480389, "time_forward": 0.398798, "time_loss": 0.000246}
[03/28 00:07:38] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "4230", "eta": "1:09:32", "loss": 0.120265, "lr": 0.007029, "mode": "train", "time_backward": 1.054714, "time_data": 0.016832, "time_diff": 1.474924, "time_forward": 0.399128, "time_loss": 0.000350}
[03/28 00:08:25] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "4240", "eta": "1:09:07", "loss": 0.125278, "lr": 0.007045, "mode": "train", "time_backward": 1.058293, "time_data": 0.017035, "time_diff": 1.480829, "time_forward": 0.401815, "time_loss": 0.000401}
[03/28 00:09:39] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "4250", "eta": "1:08:42", "loss": 0.124143, "lr": 0.007062, "mode": "train", "time_backward": 1.053711, "time_data": 0.017380, "time_diff": 1.475279, "time_forward": 0.398803, "time_loss": 0.000358}
[03/28 00:09:59] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "4260", "eta": "1:08:16", "loss": 0.132850, "lr": 0.007078, "mode": "train", "time_backward": 1.056448, "time_data": 0.023204, "time_diff": 1.483166, "time_forward": 0.400154, "time_loss": 0.000346}
[03/28 00:10:20] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "4270", "eta": "1:07:51", "loss": 0.109972, "lr": 0.007094, "mode": "train", "time_backward": 1.097946, "time_data": 0.016944, "time_diff": 1.517439, "time_forward": 0.399150, "time_loss": 0.000241}
[03/28 00:10:41] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "4280", "eta": "1:07:41", "loss": 0.121306, "lr": 0.007111, "mode": "train", "time_backward": 4.595614, "time_data": 0.016866, "time_diff": 5.020806, "time_forward": 0.401808, "time_loss": 0.000216}
[03/28 00:11:02] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "4290", "eta": "1:07:16", "loss": 0.137414, "lr": 0.007127, "mode": "train", "time_backward": 1.086511, "time_data": 0.016811, "time_diff": 1.510194, "time_forward": 0.398183, "time_loss": 0.000256}
[03/28 00:11:38] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "4300", "eta": "1:06:50", "loss": 0.129319, "lr": 0.007143, "mode": "train", "time_backward": 1.057299, "time_data": 0.016571, "time_diff": 1.476303, "time_forward": 0.399017, "time_loss": 0.000208}
[03/28 00:12:09] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "4310", "eta": "1:06:25", "loss": 0.122378, "lr": 0.007160, "mode": "train", "time_backward": 1.055140, "time_data": 0.016987, "time_diff": 1.476906, "time_forward": 0.399735, "time_loss": 0.000294}
[03/28 00:13:01] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "4320", "eta": "1:06:00", "loss": 0.136543, "lr": 0.007176, "mode": "train", "time_backward": 1.061477, "time_data": 0.016897, "time_diff": 1.480791, "time_forward": 0.398942, "time_loss": 0.000263}
[03/28 00:13:27] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "4330", "eta": "1:05:35", "loss": 0.106897, "lr": 0.007192, "mode": "train", "time_backward": 1.055887, "time_data": 0.016964, "time_diff": 1.478264, "time_forward": 0.398882, "time_loss": 0.000245}
[03/28 00:13:42] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "4340", "eta": "1:05:10", "loss": 0.127570, "lr": 0.007209, "mode": "train", "time_backward": 1.055727, "time_data": 0.016663, "time_diff": 1.493160, "time_forward": 0.417179, "time_loss": 0.000410}
[03/28 00:14:06] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "4350", "eta": "1:04:45", "loss": 0.140028, "lr": 0.007225, "mode": "train", "time_backward": 1.053883, "time_data": 0.016958, "time_diff": 1.476242, "time_forward": 0.398578, "time_loss": 0.000278}
[03/28 00:14:32] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "4360", "eta": "1:04:57", "loss": 0.119972, "lr": 0.007241, "mode": "train", "time_backward": 10.130459, "time_data": 0.016741, "time_diff": 10.628665, "time_forward": 0.400219, "time_loss": 0.000235}
[03/28 00:14:47] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "4370", "eta": "1:04:32", "loss": 0.116783, "lr": 0.007258, "mode": "train", "time_backward": 1.158013, "time_data": 0.020018, "time_diff": 1.614431, "time_forward": 0.409208, "time_loss": 0.000361}
[03/28 00:15:13] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "4380", "eta": "1:04:07", "loss": 0.136827, "lr": 0.007274, "mode": "train", "time_backward": 1.096353, "time_data": 0.017047, "time_diff": 1.522259, "time_forward": 0.402055, "time_loss": 0.000268}
[03/28 00:15:28] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "4390", "eta": "1:03:42", "loss": 0.132532, "lr": 0.007290, "mode": "train", "time_backward": 1.055462, "time_data": 0.016884, "time_diff": 1.476805, "time_forward": 0.399410, "time_loss": 0.000421}
[03/28 00:15:44] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "4400", "eta": "1:03:17", "loss": 0.125556, "lr": 0.007307, "mode": "train", "time_backward": 1.054870, "time_data": 0.017606, "time_diff": 1.483638, "time_forward": 0.399478, "time_loss": 0.000303}
[03/28 00:16:00] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "4410", "eta": "1:02:52", "loss": 0.136566, "lr": 0.007323, "mode": "train", "time_backward": 1.056454, "time_data": 0.017463, "time_diff": 1.477503, "time_forward": 0.400421, "time_loss": 0.000212}
[03/28 00:16:25] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "4420", "eta": "1:02:27", "loss": 0.136029, "lr": 0.007339, "mode": "train", "time_backward": 1.056128, "time_data": 0.024957, "time_diff": 1.533277, "time_forward": 0.445818, "time_loss": 0.000339}
[03/28 00:16:42] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "4430", "eta": "1:02:02", "loss": 0.121786, "lr": 0.007356, "mode": "train", "time_backward": 1.053931, "time_data": 0.016610, "time_diff": 1.479582, "time_forward": 0.397599, "time_loss": 0.000299}
[03/28 00:17:22] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "4440", "eta": "1:01:37", "loss": 0.128928, "lr": 0.007372, "mode": "train", "time_backward": 1.064864, "time_data": 0.016757, "time_diff": 1.486732, "time_forward": 0.401530, "time_loss": 0.000474}
[03/28 00:17:50] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "4450", "eta": "1:01:12", "loss": 0.137900, "lr": 0.007388, "mode": "train", "time_backward": 1.054651, "time_data": 0.016667, "time_diff": 1.472699, "time_forward": 0.397391, "time_loss": 0.000223}
[03/28 00:18:19] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "4460", "eta": "1:00:47", "loss": 0.143643, "lr": 0.007405, "mode": "train", "time_backward": 1.055069, "time_data": 0.017027, "time_diff": 1.477581, "time_forward": 0.398907, "time_loss": 0.000229}
[03/28 00:18:36] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "4470", "eta": "1:00:23", "loss": 0.108317, "lr": 0.007421, "mode": "train", "time_backward": 1.099734, "time_data": 0.016965, "time_diff": 1.523324, "time_forward": 0.399350, "time_loss": 0.000320}
[03/28 00:19:18] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "4480", "eta": "0:59:58", "loss": 0.120114, "lr": 0.007437, "mode": "train", "time_backward": 1.057365, "time_data": 0.018700, "time_diff": 1.478419, "time_forward": 0.399439, "time_loss": 0.000233}
[03/28 00:19:58] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "4490", "eta": "0:59:33", "loss": 0.118980, "lr": 0.007454, "mode": "train", "time_backward": 1.054547, "time_data": 0.017619, "time_diff": 1.515445, "time_forward": 0.439634, "time_loss": 0.000247}
[03/28 00:20:49] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "4500", "eta": "1:00:59", "loss": 0.136421, "lr": 0.007470, "mode": "train", "time_backward": 31.955853, "time_data": 0.016854, "time_diff": 32.402645, "time_forward": 0.397635, "time_loss": 0.000235}
[03/28 00:21:04] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "4510", "eta": "1:00:34", "loss": 0.127968, "lr": 0.007486, "mode": "train", "time_backward": 1.056505, "time_data": 0.016535, "time_diff": 1.474339, "time_forward": 0.397843, "time_loss": 0.000197}
[03/28 00:21:31] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "4520", "eta": "1:00:08", "loss": 0.133885, "lr": 0.007503, "mode": "train", "time_backward": 1.067115, "time_data": 0.018773, "time_diff": 1.491768, "time_forward": 0.398700, "time_loss": 0.000230}
[03/28 00:21:46] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "4530", "eta": "0:59:43", "loss": 0.122261, "lr": 0.007519, "mode": "train", "time_backward": 1.055037, "time_data": 0.016904, "time_diff": 1.482866, "time_forward": 0.403791, "time_loss": 0.000406}
[03/28 00:22:10] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "4540", "eta": "0:59:18", "loss": 0.116594, "lr": 0.007535, "mode": "train", "time_backward": 1.054251, "time_data": 0.018546, "time_diff": 1.476426, "time_forward": 0.399917, "time_loss": 0.000306}
[03/28 00:22:41] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "4550", "eta": "0:58:52", "loss": 0.135140, "lr": 0.007552, "mode": "train", "time_backward": 1.075781, "time_data": 0.017235, "time_diff": 1.499541, "time_forward": 0.398416, "time_loss": 0.000249}
[03/28 00:22:56] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "4560", "eta": "0:58:27", "loss": 0.145914, "lr": 0.007568, "mode": "train", "time_backward": 1.056897, "time_data": 0.018712, "time_diff": 1.479261, "time_forward": 0.398642, "time_loss": 0.000256}
[03/28 00:23:22] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "4570", "eta": "0:58:02", "loss": 0.128184, "lr": 0.007584, "mode": "train", "time_backward": 1.108712, "time_data": 0.017211, "time_diff": 1.534251, "time_forward": 0.402153, "time_loss": 0.000301}
[03/28 00:25:01] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "4580", "eta": "1:02:10", "loss": 0.133173, "lr": 0.007601, "mode": "train", "time_backward": 82.720759, "time_data": 0.016872, "time_diff": 83.323207, "time_forward": 0.399236, "time_loss": 0.000238}
[03/28 00:25:20] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "4590", "eta": "1:01:43", "loss": 0.123643, "lr": 0.007617, "mode": "train", "time_backward": 1.059540, "time_data": 0.017343, "time_diff": 1.481137, "time_forward": 0.398776, "time_loss": 0.000274}
[03/28 00:28:19] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "4600", "eta": "1:01:15", "loss": 0.122114, "lr": 0.007633, "mode": "train", "time_backward": 1.057107, "time_data": 0.017471, "time_diff": 1.480040, "time_forward": 0.398781, "time_loss": 0.000285}
[03/28 00:28:40] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "4610", "eta": "1:00:48", "loss": 0.135438, "lr": 0.007650, "mode": "train", "time_backward": 1.056700, "time_data": 0.017164, "time_diff": 1.481162, "time_forward": 0.399057, "time_loss": 0.000290}
[03/28 00:31:02] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "4620", "eta": "1:00:21", "loss": 0.111812, "lr": 0.007666, "mode": "train", "time_backward": 1.057414, "time_data": 0.017787, "time_diff": 1.482295, "time_forward": 0.402105, "time_loss": 0.000293}
[03/28 00:31:22] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "4630", "eta": "0:59:53", "loss": 0.131831, "lr": 0.007682, "mode": "train", "time_backward": 1.058631, "time_data": 0.017013, "time_diff": 1.488572, "time_forward": 0.399323, "time_loss": 0.000256}
[03/28 00:31:37] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "4640", "eta": "0:59:26", "loss": 0.133037, "lr": 0.007698, "mode": "train", "time_backward": 1.055893, "time_data": 0.016809, "time_diff": 1.476543, "time_forward": 0.399329, "time_loss": 0.000365}
[03/28 00:32:16] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "4650", "eta": "0:58:59", "loss": 0.125520, "lr": 0.007715, "mode": "train", "time_backward": 1.056399, "time_data": 0.017379, "time_diff": 1.477796, "time_forward": 0.399309, "time_loss": 0.000355}
[03/28 00:33:02] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "4660", "eta": "1:00:10", "loss": 0.128760, "lr": 0.007731, "mode": "train", "time_backward": 32.798094, "time_data": 0.016859, "time_diff": 33.213804, "time_forward": 0.398143, "time_loss": 0.000244}
[03/28 00:33:18] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "4670", "eta": "0:59:43", "loss": 0.147703, "lr": 0.007747, "mode": "train", "time_backward": 1.209035, "time_data": 0.016822, "time_diff": 1.801782, "time_forward": 0.575054, "time_loss": 0.000417}
[03/28 00:33:53] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "4680", "eta": "0:59:15", "loss": 0.138510, "lr": 0.007764, "mode": "train", "time_backward": 1.056158, "time_data": 0.016904, "time_diff": 1.479727, "time_forward": 0.399529, "time_loss": 0.000255}
[03/28 00:34:36] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "4690", "eta": "1:00:03", "loss": 0.122494, "lr": 0.007780, "mode": "train", "time_backward": 25.960449, "time_data": 0.016975, "time_diff": 26.382461, "time_forward": 0.398369, "time_loss": 0.000266}
[03/28 00:35:01] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "4700", "eta": "0:59:34", "loss": 0.126280, "lr": 0.007796, "mode": "train", "time_backward": 1.053731, "time_data": 0.017013, "time_diff": 1.475719, "time_forward": 0.397748, "time_loss": 0.000216}
[03/28 00:35:57] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "4710", "eta": "0:59:06", "loss": 0.118532, "lr": 0.007813, "mode": "train", "time_backward": 1.063532, "time_data": 0.017069, "time_diff": 1.528969, "time_forward": 0.445783, "time_loss": 0.000288}
[03/28 00:36:13] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "4720", "eta": "0:58:40", "loss": 0.130470, "lr": 0.007829, "mode": "train", "time_backward": 1.082437, "time_data": 0.056094, "time_diff": 2.111608, "time_forward": 0.953497, "time_loss": 0.000529}
[03/28 00:36:38] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "4730", "eta": "0:58:11", "loss": 0.110834, "lr": 0.007845, "mode": "train", "time_backward": 1.065797, "time_data": 0.016870, "time_diff": 1.486369, "time_forward": 0.398923, "time_loss": 0.000261}
[03/28 00:37:07] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "4740", "eta": "0:58:24", "loss": 0.138789, "lr": 0.007862, "mode": "train", "time_backward": 15.108115, "time_data": 0.016841, "time_diff": 15.535636, "time_forward": 0.399026, "time_loss": 0.000224}
[03/28 00:37:27] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "4750", "eta": "0:57:55", "loss": 0.135698, "lr": 0.007878, "mode": "train", "time_backward": 1.102165, "time_data": 0.016873, "time_diff": 1.525141, "time_forward": 0.398963, "time_loss": 0.000258}
[03/28 00:37:52] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "4760", "eta": "0:57:27", "loss": 0.118839, "lr": 0.007894, "mode": "train", "time_backward": 1.057729, "time_data": 0.020695, "time_diff": 1.485698, "time_forward": 0.398846, "time_loss": 0.000231}
[03/28 00:38:13] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "4770", "eta": "0:56:58", "loss": 0.121810, "lr": 0.007911, "mode": "train", "time_backward": 1.111637, "time_data": 0.017239, "time_diff": 1.561944, "time_forward": 0.399792, "time_loss": 0.000344}
[03/28 00:38:48] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "4780", "eta": "0:56:30", "loss": 0.109700, "lr": 0.007927, "mode": "train", "time_backward": 1.055847, "time_data": 0.017559, "time_diff": 1.481307, "time_forward": 0.398814, "time_loss": 0.000299}
[03/28 00:39:26] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "4790", "eta": "0:56:02", "loss": 0.132492, "lr": 0.007943, "mode": "train", "time_backward": 1.167547, "time_data": 0.016792, "time_diff": 1.589893, "time_forward": 0.399366, "time_loss": 0.000265}
[03/28 00:39:40] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "4800", "eta": "0:55:33", "loss": 0.129026, "lr": 0.007960, "mode": "train", "time_backward": 1.060649, "time_data": 0.021204, "time_diff": 1.526376, "time_forward": 0.442288, "time_loss": 0.000363}
[03/28 00:40:11] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "4810", "eta": "0:55:05", "loss": 0.126586, "lr": 0.007976, "mode": "train", "time_backward": 1.058726, "time_data": 0.019296, "time_diff": 1.481074, "time_forward": 0.398978, "time_loss": 0.000365}
[03/28 00:40:53] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "4820", "eta": "0:55:49", "loss": 0.113927, "lr": 0.007992, "mode": "train", "time_backward": 28.128902, "time_data": 0.016866, "time_diff": 28.576580, "time_forward": 0.399843, "time_loss": 0.000284}
[03/28 00:41:09] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "4830", "eta": "0:55:21", "loss": 0.121937, "lr": 0.008009, "mode": "train", "time_backward": 1.089050, "time_data": 0.017423, "time_diff": 1.698262, "time_forward": 0.586800, "time_loss": 0.000303}
[03/28 00:41:38] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "4840", "eta": "0:54:52", "loss": 0.125952, "lr": 0.008025, "mode": "train", "time_backward": 1.056304, "time_data": 0.018268, "time_diff": 1.478710, "time_forward": 0.398942, "time_loss": 0.000241}
[03/28 00:41:53] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "4850", "eta": "0:54:23", "loss": 0.134683, "lr": 0.008041, "mode": "train", "time_backward": 1.057097, "time_data": 0.017468, "time_diff": 1.477593, "time_forward": 0.398533, "time_loss": 0.000250}
[03/28 00:42:31] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "4860", "eta": "0:53:55", "loss": 0.110871, "lr": 0.008058, "mode": "train", "time_backward": 1.093079, "time_data": 0.017051, "time_diff": 1.522791, "time_forward": 0.399635, "time_loss": 0.000249}
[03/28 00:42:53] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "4870", "eta": "0:53:26", "loss": 0.136750, "lr": 0.008074, "mode": "train", "time_backward": 1.052083, "time_data": 0.017560, "time_diff": 1.474552, "time_forward": 0.397970, "time_loss": 0.000228}
[03/28 00:43:12] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "4880", "eta": "0:52:57", "loss": 0.120002, "lr": 0.008090, "mode": "train", "time_backward": 1.054981, "time_data": 0.018665, "time_diff": 1.476749, "time_forward": 0.398133, "time_loss": 0.000287}
[03/28 00:43:42] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "4890", "eta": "0:52:29", "loss": 0.125013, "lr": 0.008107, "mode": "train", "time_backward": 1.067327, "time_data": 0.016789, "time_diff": 1.486534, "time_forward": 0.398935, "time_loss": 0.000243}
[03/28 00:44:07] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "4900", "eta": "0:52:01", "loss": 0.124737, "lr": 0.008123, "mode": "train", "time_backward": 1.052716, "time_data": 0.016995, "time_diff": 1.537021, "time_forward": 0.399517, "time_loss": 0.000320}
[03/28 00:44:44] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "4910", "eta": "0:51:32", "loss": 0.133631, "lr": 0.008139, "mode": "train", "time_backward": 1.065792, "time_data": 0.017297, "time_diff": 1.495033, "time_forward": 0.405321, "time_loss": 0.000920}
[03/28 00:44:59] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "4920", "eta": "0:51:04", "loss": 0.128975, "lr": 0.008156, "mode": "train", "time_backward": 1.123170, "time_data": 0.020800, "time_diff": 1.564039, "time_forward": 0.399539, "time_loss": 0.000338}
[03/28 00:45:25] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "4930", "eta": "0:50:36", "loss": 0.136504, "lr": 0.008172, "mode": "train", "time_backward": 1.067901, "time_data": 0.016965, "time_diff": 1.503244, "time_forward": 0.399668, "time_loss": 0.000312}
[03/28 00:45:41] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "4940", "eta": "0:50:08", "loss": 0.123387, "lr": 0.008188, "mode": "train", "time_backward": 1.084181, "time_data": 0.018298, "time_diff": 1.550823, "time_forward": 0.397634, "time_loss": 0.000268}
[03/28 00:45:57] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "4950", "eta": "0:49:39", "loss": 0.120849, "lr": 0.008205, "mode": "train", "time_backward": 1.134809, "time_data": 0.016962, "time_diff": 1.564448, "time_forward": 0.408998, "time_loss": 0.000297}
[03/28 00:46:13] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "4960", "eta": "0:49:11", "loss": 0.121509, "lr": 0.008221, "mode": "train", "time_backward": 1.102962, "time_data": 0.018250, "time_diff": 1.528798, "time_forward": 0.404085, "time_loss": 0.000512}
[03/28 00:46:29] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "4970", "eta": "0:48:43", "loss": 0.126224, "lr": 0.008237, "mode": "train", "time_backward": 1.055682, "time_data": 0.017608, "time_diff": 1.481340, "time_forward": 0.397834, "time_loss": 0.000289}
[03/28 00:46:44] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "4980", "eta": "0:48:15", "loss": 0.133229, "lr": 0.008254, "mode": "train", "time_backward": 1.055791, "time_data": 0.017265, "time_diff": 1.476525, "time_forward": 0.399658, "time_loss": 0.000287}
[03/28 00:47:18] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "4990", "eta": "0:47:47", "loss": 0.138805, "lr": 0.008270, "mode": "train", "time_backward": 1.055615, "time_data": 0.018021, "time_diff": 1.513697, "time_forward": 0.398582, "time_loss": 0.000345}
[03/28 00:47:33] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "5000", "eta": "0:47:19", "loss": 0.124213, "lr": 0.008286, "mode": "train", "time_backward": 1.055896, "time_data": 0.017014, "time_diff": 1.479005, "time_forward": 0.399123, "time_loss": 0.000273}
[03/28 00:48:01] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "5010", "eta": "0:46:54", "loss": 0.146909, "lr": 0.008303, "mode": "train", "time_backward": 1.096077, "time_data": 0.027160, "time_diff": 1.566397, "time_forward": 0.421718, "time_loss": 0.000282}
[03/28 00:48:20] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "5020", "eta": "0:46:28", "loss": 0.114940, "lr": 0.008319, "mode": "train", "time_backward": 1.111679, "time_data": 0.017185, "time_diff": 1.531698, "time_forward": 0.399629, "time_loss": 0.000292}
[03/28 00:49:08] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "5030", "eta": "0:46:03", "loss": 0.121433, "lr": 0.008335, "mode": "train", "time_backward": 1.058940, "time_data": 0.019993, "time_diff": 1.483145, "time_forward": 0.400640, "time_loss": 0.000434}
[03/28 00:49:23] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "5040", "eta": "0:45:37", "loss": 0.122210, "lr": 0.008352, "mode": "train", "time_backward": 1.061217, "time_data": 0.026168, "time_diff": 1.560164, "time_forward": 0.452702, "time_loss": 0.000282}
[03/28 00:49:38] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "5050", "eta": "0:45:12", "loss": 0.130261, "lr": 0.008368, "mode": "train", "time_backward": 1.102896, "time_data": 0.017115, "time_diff": 1.527692, "time_forward": 0.399982, "time_loss": 0.000484}
[03/28 00:49:54] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "5060", "eta": "0:44:46", "loss": 0.124055, "lr": 0.008384, "mode": "train", "time_backward": 1.092842, "time_data": 0.018987, "time_diff": 1.521474, "time_forward": 0.399044, "time_loss": 0.000271}
[03/28 00:50:10] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "5070", "eta": "0:44:21", "loss": 0.116371, "lr": 0.008400, "mode": "train", "time_backward": 1.054749, "time_data": 0.017888, "time_diff": 1.516513, "time_forward": 0.436898, "time_loss": 0.000255}
[03/28 00:50:26] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "5080", "eta": "0:43:56", "loss": 0.134692, "lr": 0.008417, "mode": "train", "time_backward": 1.105172, "time_data": 0.021114, "time_diff": 1.565306, "time_forward": 0.434653, "time_loss": 0.000313}
[03/28 00:50:42] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "5090", "eta": "0:43:30", "loss": 0.117408, "lr": 0.008433, "mode": "train", "time_backward": 1.102179, "time_data": 0.027242, "time_diff": 1.537074, "time_forward": 0.404295, "time_loss": 0.000300}
[03/28 00:50:58] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "5100", "eta": "0:43:05", "loss": 0.120844, "lr": 0.008449, "mode": "train", "time_backward": 1.070965, "time_data": 0.032543, "time_diff": 1.530053, "time_forward": 0.420910, "time_loss": 0.000446}
[03/28 00:51:14] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "5110", "eta": "0:42:39", "loss": 0.123432, "lr": 0.008466, "mode": "train", "time_backward": 1.072537, "time_data": 0.016797, "time_diff": 1.494936, "time_forward": 0.402778, "time_loss": 0.000305}
[03/28 00:51:30] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "5120", "eta": "0:42:14", "loss": 0.127802, "lr": 0.008482, "mode": "train", "time_backward": 1.101703, "time_data": 0.020744, "time_diff": 1.588379, "time_forward": 0.402773, "time_loss": 0.000607}
[03/28 00:51:46] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "5130", "eta": "0:41:49", "loss": 0.126758, "lr": 0.008498, "mode": "train", "time_backward": 1.197407, "time_data": 0.017330, "time_diff": 1.656818, "time_forward": 0.438434, "time_loss": 0.000246}
[03/28 00:52:02] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "5140", "eta": "0:41:23", "loss": 0.131715, "lr": 0.008515, "mode": "train", "time_backward": 1.182987, "time_data": 0.035882, "time_diff": 1.666263, "time_forward": 0.443899, "time_loss": 0.000260}
[03/28 00:52:18] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "5150", "eta": "0:40:58", "loss": 0.118371, "lr": 0.008531, "mode": "train", "time_backward": 1.147191, "time_data": 0.017984, "time_diff": 1.587795, "time_forward": 0.410936, "time_loss": 0.000236}
[03/28 00:52:35] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "5160", "eta": "0:40:33", "loss": 0.122578, "lr": 0.008547, "mode": "train", "time_backward": 1.089392, "time_data": 0.017566, "time_diff": 1.599977, "time_forward": 0.404080, "time_loss": 0.000474}
[03/28 00:52:51] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "5170", "eta": "0:40:06", "loss": 0.120757, "lr": 0.008564, "mode": "train", "time_backward": 1.108186, "time_data": 0.024775, "time_diff": 1.540292, "time_forward": 0.406298, "time_loss": 0.000700}
[03/28 00:53:07] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "5180", "eta": "0:39:41", "loss": 0.121224, "lr": 0.008580, "mode": "train", "time_backward": 1.066197, "time_data": 0.017445, "time_diff": 1.491834, "time_forward": 0.400699, "time_loss": 0.000252}
[03/28 00:53:22] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "5190", "eta": "0:39:15", "loss": 0.138889, "lr": 0.008596, "mode": "train", "time_backward": 1.075899, "time_data": 0.018964, "time_diff": 1.526429, "time_forward": 0.399147, "time_loss": 0.000337}
[03/28 00:53:38] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "5200", "eta": "0:38:50", "loss": 0.123795, "lr": 0.008613, "mode": "train", "time_backward": 1.071060, "time_data": 0.039037, "time_diff": 1.526279, "time_forward": 0.405607, "time_loss": 0.000227}
[03/28 00:53:57] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "5210", "eta": "0:38:24", "loss": 0.136900, "lr": 0.008629, "mode": "train", "time_backward": 1.057417, "time_data": 0.023745, "time_diff": 1.491861, "time_forward": 0.403364, "time_loss": 0.000408}
[03/28 00:54:12] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "5220", "eta": "0:37:59", "loss": 0.132655, "lr": 0.008645, "mode": "train", "time_backward": 1.057705, "time_data": 0.016973, "time_diff": 1.480315, "time_forward": 0.399187, "time_loss": 0.000399}
[03/28 00:54:27] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "5230", "eta": "0:37:33", "loss": 0.131547, "lr": 0.008662, "mode": "train", "time_backward": 1.133593, "time_data": 0.017051, "time_diff": 1.614837, "time_forward": 0.399144, "time_loss": 0.000245}
[03/28 00:54:55] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "5240", "eta": "0:37:08", "loss": 0.119175, "lr": 0.008678, "mode": "train", "time_backward": 1.056175, "time_data": 0.017076, "time_diff": 1.482802, "time_forward": 0.398820, "time_loss": 0.000234}
[03/28 00:55:42] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "5250", "eta": "0:36:42", "loss": 0.116648, "lr": 0.008694, "mode": "train", "time_backward": 1.084727, "time_data": 0.021097, "time_diff": 1.528819, "time_forward": 0.414501, "time_loss": 0.000469}
[03/28 00:56:01] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "5260", "eta": "0:36:17", "loss": 0.128614, "lr": 0.008711, "mode": "train", "time_backward": 1.056987, "time_data": 0.017173, "time_diff": 1.484319, "time_forward": 0.399624, "time_loss": 0.000404}
[03/28 00:56:23] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "5270", "eta": "0:35:51", "loss": 0.113173, "lr": 0.008727, "mode": "train", "time_backward": 1.066651, "time_data": 0.016914, "time_diff": 1.516358, "time_forward": 0.399873, "time_loss": 0.000258}
[03/28 00:57:05] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "5280", "eta": "0:34:55", "loss": 0.127941, "lr": 0.008743, "mode": "train", "time_backward": 1.055839, "time_data": 0.016838, "time_diff": 1.479004, "time_forward": 0.399372, "time_loss": 0.000259}
[03/28 00:57:35] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "5290", "eta": "0:34:31", "loss": 0.118360, "lr": 0.008760, "mode": "train", "time_backward": 1.186878, "time_data": 0.031595, "time_diff": 1.638362, "time_forward": 0.416506, "time_loss": 0.000266}
[03/28 00:57:50] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "5300", "eta": "0:34:05", "loss": 0.118432, "lr": 0.008776, "mode": "train", "time_backward": 1.059796, "time_data": 0.016810, "time_diff": 1.483653, "time_forward": 0.399146, "time_loss": 0.000262}
[03/28 00:58:06] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "5310", "eta": "0:33:40", "loss": 0.122055, "lr": 0.008792, "mode": "train", "time_backward": 1.174801, "time_data": 0.021828, "time_diff": 1.726863, "time_forward": 0.399109, "time_loss": 0.000292}
[03/28 00:58:26] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "5320", "eta": "0:33:21", "loss": 0.109870, "lr": 0.008809, "mode": "train", "time_backward": 1.057481, "time_data": 3.846368, "time_diff": 5.368830, "time_forward": 0.412161, "time_loss": 0.000331}
[03/28 00:58:41] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "5330", "eta": "0:32:40", "loss": 0.126539, "lr": 0.008825, "mode": "train", "time_backward": 1.095064, "time_data": 0.017162, "time_diff": 1.518545, "time_forward": 0.398856, "time_loss": 0.000242}
[03/28 00:58:58] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "5340", "eta": "0:31:22", "loss": 0.107802, "lr": 0.008841, "mode": "train", "time_backward": 1.094883, "time_data": 0.017874, "time_diff": 1.567085, "time_forward": 0.432877, "time_loss": 0.000354}
[03/28 00:59:14] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "5350", "eta": "0:30:57", "loss": 0.110984, "lr": 0.008858, "mode": "train", "time_backward": 1.115217, "time_data": 0.017459, "time_diff": 1.570171, "time_forward": 0.433520, "time_loss": 0.000671}
[03/28 00:59:29] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "5360", "eta": "0:30:33", "loss": 0.116629, "lr": 0.008874, "mode": "train", "time_backward": 1.103855, "time_data": 0.021527, "time_diff": 1.559257, "time_forward": 0.424440, "time_loss": 0.000357}
[03/28 00:59:45] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "5370", "eta": "0:29:40", "loss": 0.118301, "lr": 0.008890, "mode": "train", "time_backward": 1.117543, "time_data": 0.017010, "time_diff": 1.540296, "time_forward": 0.398641, "time_loss": 0.000314}
[03/28 01:00:00] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "5380", "eta": "0:29:16", "loss": 0.122062, "lr": 0.008907, "mode": "train", "time_backward": 1.053931, "time_data": 0.017215, "time_diff": 1.482527, "time_forward": 0.406236, "time_loss": 0.000469}
[03/28 01:00:15] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "5390", "eta": "0:28:52", "loss": 0.136245, "lr": 0.008923, "mode": "train", "time_backward": 1.086162, "time_data": 0.017206, "time_diff": 1.678970, "time_forward": 0.569234, "time_loss": 0.000362}
[03/28 01:00:31] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "5400", "eta": "0:28:28", "loss": 0.109376, "lr": 0.008939, "mode": "train", "time_backward": 1.060265, "time_data": 0.017034, "time_diff": 1.499207, "time_forward": 0.399773, "time_loss": 0.000614}
[03/28 01:00:46] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "5410", "eta": "0:27:37", "loss": 0.127235, "lr": 0.008956, "mode": "train", "time_backward": 1.057087, "time_data": 0.017067, "time_diff": 1.477193, "time_forward": 0.399198, "time_loss": 0.000389}
[03/28 01:01:27] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "5420", "eta": "0:27:10", "loss": 0.141172, "lr": 0.008972, "mode": "train", "time_backward": 1.090492, "time_data": 0.017315, "time_diff": 1.515993, "time_forward": 0.401115, "time_loss": 0.000332}
[03/28 01:01:56] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "5430", "eta": "0:27:02", "loss": 0.129637, "lr": 0.008988, "mode": "train", "time_backward": 1.592492, "time_data": 9.624552, "time_diff": 12.749053, "time_forward": 1.348826, "time_loss": 0.107176}
[03/28 01:02:26] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "5440", "eta": "0:26:38", "loss": 0.111735, "lr": 0.009005, "mode": "train", "time_backward": 1.082846, "time_data": 0.017165, "time_diff": 1.509839, "time_forward": 0.401820, "time_loss": 0.000234}
[03/28 01:02:43] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "5450", "eta": "0:26:04", "loss": 0.116078, "lr": 0.009021, "mode": "train", "time_backward": 1.225251, "time_data": 0.029929, "time_diff": 1.661547, "time_forward": 0.398815, "time_loss": 0.000222}
[03/28 01:02:58] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "5460", "eta": "0:25:40", "loss": 0.119221, "lr": 0.009037, "mode": "train", "time_backward": 1.099326, "time_data": 0.027701, "time_diff": 1.531594, "time_forward": 0.397624, "time_loss": 0.000261}
[03/28 01:03:13] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "5470", "eta": "0:25:16", "loss": 0.108736, "lr": 0.009054, "mode": "train", "time_backward": 1.055978, "time_data": 0.017142, "time_diff": 1.486081, "time_forward": 0.399775, "time_loss": 0.000289}
[03/28 01:03:28] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "5480", "eta": "0:24:53", "loss": 0.117446, "lr": 0.009070, "mode": "train", "time_backward": 1.054645, "time_data": 0.016691, "time_diff": 1.476986, "time_forward": 0.399004, "time_loss": 0.000312}
[03/28 01:04:10] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "5490", "eta": "0:23:40", "loss": 0.146097, "lr": 0.009086, "mode": "train", "time_backward": 1.062369, "time_data": 0.017236, "time_diff": 1.531152, "time_forward": 0.444224, "time_loss": 0.000375}
[03/28 01:04:25] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "5500", "eta": "0:23:17", "loss": 0.120937, "lr": 0.009103, "mode": "train", "time_backward": 1.062778, "time_data": 0.016938, "time_diff": 1.485961, "time_forward": 0.398306, "time_loss": 0.000252}
[03/28 01:04:46] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "5510", "eta": "0:22:54", "loss": 0.109114, "lr": 0.009119, "mode": "train", "time_backward": 1.062892, "time_data": 0.017221, "time_diff": 1.483625, "time_forward": 0.400018, "time_loss": 0.000261}
[03/28 01:06:15] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "5520", "eta": "0:22:23", "loss": 0.131813, "lr": 0.009135, "mode": "train", "time_backward": 1.058391, "time_data": 0.017074, "time_diff": 1.500054, "time_forward": 0.399428, "time_loss": 0.000260}
[03/28 01:06:30] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "5530", "eta": "0:22:00", "loss": 0.118328, "lr": 0.009151, "mode": "train", "time_backward": 1.053462, "time_data": 0.016778, "time_diff": 1.473576, "time_forward": 0.397371, "time_loss": 0.000216}
[03/28 01:07:36] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "5540", "eta": "0:21:38", "loss": 0.140198, "lr": 0.009168, "mode": "train", "time_backward": 1.059878, "time_data": 0.017006, "time_diff": 1.478279, "time_forward": 0.397977, "time_loss": 0.000239}
[03/28 01:11:41] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "5550", "eta": "0:21:15", "loss": 0.124336, "lr": 0.009184, "mode": "train", "time_backward": 1.062494, "time_data": 0.017412, "time_diff": 1.482107, "time_forward": 0.398982, "time_loss": 0.000241}
[03/28 01:11:56] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "5560", "eta": "0:20:52", "loss": 0.119791, "lr": 0.009200, "mode": "train", "time_backward": 1.056894, "time_data": 0.017149, "time_diff": 1.479111, "time_forward": 0.398183, "time_loss": 0.000587}
[03/28 01:17:11] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "5570", "eta": "0:20:19", "loss": 0.115117, "lr": 0.009217, "mode": "train", "time_backward": 1.055469, "time_data": 0.016789, "time_diff": 1.475929, "time_forward": 0.398775, "time_loss": 0.000247}
[03/28 01:17:26] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "5580", "eta": "0:19:56", "loss": 0.114446, "lr": 0.009233, "mode": "train", "time_backward": 1.099603, "time_data": 0.016924, "time_diff": 1.523031, "time_forward": 0.399269, "time_loss": 0.000233}
[03/28 01:18:07] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "5590", "eta": "0:19:33", "loss": 0.107314, "lr": 0.009249, "mode": "train", "time_backward": 1.091584, "time_data": 0.017271, "time_diff": 1.520400, "time_forward": 0.405323, "time_loss": 0.000244}
[03/28 01:18:38] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "5600", "eta": "0:19:11", "loss": 0.119979, "lr": 0.009266, "mode": "train", "time_backward": 1.099719, "time_data": 0.019139, "time_diff": 1.570199, "time_forward": 0.399296, "time_loss": 0.000354}
[03/28 01:18:54] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "5610", "eta": "0:18:36", "loss": 0.116878, "lr": 0.009282, "mode": "train", "time_backward": 1.176818, "time_data": 0.017770, "time_diff": 1.606660, "time_forward": 0.405514, "time_loss": 0.000366}
[03/28 01:19:11] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "5620", "eta": "0:18:13", "loss": 0.124225, "lr": 0.009298, "mode": "train", "time_backward": 1.056032, "time_data": 0.019169, "time_diff": 1.483245, "time_forward": 0.398945, "time_loss": 0.000192}
[03/28 01:20:17] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "5630", "eta": "0:18:02", "loss": 0.140693, "lr": 0.009315, "mode": "train", "time_backward": 1.063514, "time_data": 11.781284, "time_diff": 13.260506, "time_forward": 0.409389, "time_loss": 0.000345}
[03/28 01:20:33] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "5640", "eta": "0:17:40", "loss": 0.128257, "lr": 0.009331, "mode": "train", "time_backward": 1.057229, "time_data": 0.017616, "time_diff": 1.478594, "time_forward": 0.398481, "time_loss": 0.000242}
[03/28 01:20:48] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "5650", "eta": "0:17:15", "loss": 0.135422, "lr": 0.009347, "mode": "train", "time_backward": 1.093058, "time_data": 0.018333, "time_diff": 1.534237, "time_forward": 0.399254, "time_loss": 0.000397}
[03/28 01:21:04] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "5660", "eta": "0:16:52", "loss": 0.129008, "lr": 0.009364, "mode": "train", "time_backward": 1.104930, "time_data": 0.039756, "time_diff": 1.604310, "time_forward": 0.439223, "time_loss": 0.000404}
[03/28 01:21:20] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "5670", "eta": "0:16:30", "loss": 0.126221, "lr": 0.009380, "mode": "train", "time_backward": 1.086301, "time_data": 0.017205, "time_diff": 1.518150, "time_forward": 0.397759, "time_loss": 0.000262}
[03/28 01:21:36] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "5680", "eta": "0:16:07", "loss": 0.130196, "lr": 0.009396, "mode": "train", "time_backward": 1.129921, "time_data": 0.034039, "time_diff": 1.621343, "time_forward": 0.414166, "time_loss": 0.000279}
[03/28 01:21:52] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "5690", "eta": "0:15:45", "loss": 0.125454, "lr": 0.009413, "mode": "train", "time_backward": 1.110630, "time_data": 0.017736, "time_diff": 1.545902, "time_forward": 0.398464, "time_loss": 0.000312}
[03/28 01:22:08] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "5700", "eta": "0:15:23", "loss": 0.124693, "lr": 0.009429, "mode": "train", "time_backward": 1.068607, "time_data": 0.025107, "time_diff": 1.609299, "time_forward": 0.454902, "time_loss": 0.000342}
[03/28 01:22:23] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "5710", "eta": "0:15:00", "loss": 0.114841, "lr": 0.009445, "mode": "train", "time_backward": 1.054593, "time_data": 0.016999, "time_diff": 1.571956, "time_forward": 0.462799, "time_loss": 0.000482}
[03/28 01:22:40] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "5720", "eta": "0:14:38", "loss": 0.125965, "lr": 0.009462, "mode": "train", "time_backward": 1.108688, "time_data": 0.019662, "time_diff": 1.536170, "time_forward": 0.399555, "time_loss": 0.000328}
[03/28 01:22:55] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "5730", "eta": "0:14:15", "loss": 0.122970, "lr": 0.009478, "mode": "train", "time_backward": 1.138021, "time_data": 0.018400, "time_diff": 1.558263, "time_forward": 0.399808, "time_loss": 0.000331}
[03/28 01:23:10] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "5740", "eta": "0:13:53", "loss": 0.137414, "lr": 0.009494, "mode": "train", "time_backward": 1.139324, "time_data": 0.022609, "time_diff": 1.573825, "time_forward": 0.408164, "time_loss": 0.000352}
[03/28 01:23:26] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "5750", "eta": "0:13:31", "loss": 0.111174, "lr": 0.009511, "mode": "train", "time_backward": 1.053797, "time_data": 0.023502, "time_diff": 1.563699, "time_forward": 0.482809, "time_loss": 0.000241}
[03/28 01:23:43] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "5760", "eta": "0:13:08", "loss": 0.118843, "lr": 0.009527, "mode": "train", "time_backward": 1.149426, "time_data": 0.020713, "time_diff": 1.576198, "time_forward": 0.398610, "time_loss": 0.000265}
[03/28 01:23:58] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "5770", "eta": "0:12:41", "loss": 0.129297, "lr": 0.009543, "mode": "train", "time_backward": 1.055362, "time_data": 0.016750, "time_diff": 1.478669, "time_forward": 0.398539, "time_loss": 0.000342}
[03/28 01:24:17] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "5780", "eta": "0:12:19", "loss": 0.129842, "lr": 0.009560, "mode": "train", "time_backward": 1.062982, "time_data": 0.020621, "time_diff": 1.498265, "time_forward": 0.409777, "time_loss": 0.000382}
[03/28 01:24:33] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "5790", "eta": "0:11:56", "loss": 0.108724, "lr": 0.009576, "mode": "train", "time_backward": 1.080562, "time_data": 0.018005, "time_diff": 1.502700, "time_forward": 0.399768, "time_loss": 0.000408}
[03/28 01:24:48] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "5800", "eta": "0:11:34", "loss": 0.128009, "lr": 0.009592, "mode": "train", "time_backward": 1.054453, "time_data": 0.017793, "time_diff": 1.477126, "time_forward": 0.398499, "time_loss": 0.000256}
[03/28 01:25:21] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "5810", "eta": "0:11:11", "loss": 0.137648, "lr": 0.009609, "mode": "train", "time_backward": 1.325717, "time_data": 0.021209, "time_diff": 1.778682, "time_forward": 0.423107, "time_loss": 0.000752}
[03/28 01:25:37] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "5820", "eta": "0:10:49", "loss": 0.115339, "lr": 0.009625, "mode": "train", "time_backward": 1.061385, "time_data": 0.019277, "time_diff": 1.504936, "time_forward": 0.409231, "time_loss": 0.000281}
[03/28 01:25:53] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "5830", "eta": "0:10:26", "loss": 0.118975, "lr": 0.009641, "mode": "train", "time_backward": 1.111216, "time_data": 0.019681, "time_diff": 1.688731, "time_forward": 0.557224, "time_loss": 0.000273}
[03/28 01:26:09] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "5840", "eta": "0:10:04", "loss": 0.133714, "lr": 0.009658, "mode": "train", "time_backward": 1.090448, "time_data": 0.021014, "time_diff": 1.563047, "time_forward": 0.401552, "time_loss": 0.000647}
[03/28 01:26:25] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "5850", "eta": "0:09:42", "loss": 0.110017, "lr": 0.009674, "mode": "train", "time_backward": 1.087207, "time_data": 0.022344, "time_diff": 1.576753, "time_forward": 0.451571, "time_loss": 0.000611}
[03/28 01:26:42] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "5860", "eta": "0:09:20", "loss": 0.125330, "lr": 0.009690, "mode": "train", "time_backward": 1.239350, "time_data": 0.054891, "time_diff": 2.242820, "time_forward": 0.919725, "time_loss": 0.000624}
[03/28 01:26:58] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "5870", "eta": "0:08:58", "loss": 0.113248, "lr": 0.009707, "mode": "train", "time_backward": 1.113288, "time_data": 0.017573, "time_diff": 1.541067, "time_forward": 0.399899, "time_loss": 0.000406}
[03/28 01:27:13] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "5880", "eta": "0:08:35", "loss": 0.112876, "lr": 0.009723, "mode": "train", "time_backward": 1.079705, "time_data": 0.017931, "time_diff": 1.602264, "time_forward": 0.479985, "time_loss": 0.000288}
[03/28 01:27:29] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "5890", "eta": "0:08:10", "loss": 0.099267, "lr": 0.009739, "mode": "train", "time_backward": 1.062635, "time_data": 0.017935, "time_diff": 1.486592, "time_forward": 0.401859, "time_loss": 0.000326}
[03/28 01:27:45] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "5900", "eta": "0:07:48", "loss": 0.117990, "lr": 0.009756, "mode": "train", "time_backward": 1.060845, "time_data": 0.020089, "time_diff": 1.624945, "time_forward": 0.535290, "time_loss": 0.000273}
[03/28 01:28:01] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "5910", "eta": "0:07:25", "loss": 0.121134, "lr": 0.009772, "mode": "train", "time_backward": 1.105970, "time_data": 0.018177, "time_diff": 1.531460, "time_forward": 0.399387, "time_loss": 0.000366}
[03/28 01:28:16] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "5920", "eta": "0:07:03", "loss": 0.124518, "lr": 0.009788, "mode": "train", "time_backward": 1.176004, "time_data": 0.017327, "time_diff": 1.960310, "time_forward": 0.662481, "time_loss": 0.016422}
[03/28 01:28:32] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "5930", "eta": "0:06:41", "loss": 0.103498, "lr": 0.009805, "mode": "train", "time_backward": 1.101943, "time_data": 0.019520, "time_diff": 1.548588, "time_forward": 0.398813, "time_loss": 0.000256}
[03/28 01:28:54] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "5940", "eta": "0:06:19", "loss": 0.112648, "lr": 0.009821, "mode": "train", "time_backward": 1.054738, "time_data": 0.016951, "time_diff": 1.475212, "time_forward": 0.397670, "time_loss": 0.000248}
[03/28 01:29:58] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "5950", "eta": "0:05:55", "loss": 0.114619, "lr": 0.009837, "mode": "train", "time_backward": 1.058867, "time_data": 0.065343, "time_diff": 1.575301, "time_forward": 0.447450, "time_loss": 0.000383}
[03/28 01:30:22] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "5960", "eta": "0:05:33", "loss": 0.126610, "lr": 0.009853, "mode": "train", "time_backward": 1.070698, "time_data": 0.018463, "time_diff": 1.518459, "time_forward": 0.418126, "time_loss": 0.000276}
[03/28 01:30:38] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "5970", "eta": "0:05:11", "loss": 0.127423, "lr": 0.009870, "mode": "train", "time_backward": 1.065092, "time_data": 0.023509, "time_diff": 1.540667, "time_forward": 0.430730, "time_loss": 0.001081}
[03/28 01:30:53] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "5980", "eta": "0:04:49", "loss": 0.132721, "lr": 0.009886, "mode": "train", "time_backward": 1.113678, "time_data": 0.019517, "time_diff": 1.539584, "time_forward": 0.402888, "time_loss": 0.000269}
[03/28 01:31:10] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "5990", "eta": "0:04:27", "loss": 0.121789, "lr": 0.009902, "mode": "train", "time_backward": 1.158577, "time_data": 0.017613, "time_diff": 1.600299, "time_forward": 0.420561, "time_loss": 0.000289}
[03/28 01:31:26] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "6000", "eta": "0:04:05", "loss": 0.115179, "lr": 0.009919, "mode": "train", "time_backward": 1.082487, "time_data": 0.019265, "time_diff": 1.514428, "time_forward": 0.402900, "time_loss": 0.000347}
[03/28 01:31:41] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "6010", "eta": "0:03:43", "loss": 0.100979, "lr": 0.009935, "mode": "train", "time_backward": 1.057229, "time_data": 0.019047, "time_diff": 1.480858, "time_forward": 0.400822, "time_loss": 0.000306}
[03/28 01:31:58] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "6020", "eta": "0:03:21", "loss": 0.111269, "lr": 0.009951, "mode": "train", "time_backward": 1.210028, "time_data": 0.018155, "time_diff": 1.649989, "time_forward": 0.398883, "time_loss": 0.000332}
[03/28 01:32:13] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "6030", "eta": "0:02:59", "loss": 0.121194, "lr": 0.009968, "mode": "train", "time_backward": 1.088449, "time_data": 0.022479, "time_diff": 1.538458, "time_forward": 0.424613, "time_loss": 0.000382}
[03/28 01:32:29] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "6040", "eta": "0:02:36", "loss": 0.108200, "lr": 0.009984, "mode": "train", "time_backward": 1.060879, "time_data": 0.017677, "time_diff": 1.486340, "time_forward": 0.399899, "time_loss": 0.000273}
[03/28 01:32:44] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "6050", "eta": "0:02:14", "loss": 0.128907, "lr": 0.010000, "mode": "train", "time_backward": 1.062620, "time_data": 0.017230, "time_diff": 1.493787, "time_forward": 0.402016, "time_loss": 0.000364}
[03/28 01:32:59] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "6060", "eta": "0:01:52", "loss": 0.123625, "lr": 0.010017, "mode": "train", "time_backward": 1.094207, "time_data": 0.017164, "time_diff": 1.518996, "time_forward": 0.399088, "time_loss": 0.000243}
[03/28 01:33:34] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "6070", "eta": "0:01:30", "loss": 0.133774, "lr": 0.010033, "mode": "train", "time_backward": 1.065144, "time_data": 0.016709, "time_diff": 1.487195, "time_forward": 0.398356, "time_loss": 0.000253}
[03/28 01:34:48] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "6080", "eta": "0:01:08", "loss": 0.118409, "lr": 0.010049, "mode": "train", "time_backward": 1.072267, "time_data": 0.017568, "time_diff": 1.491615, "time_forward": 0.398523, "time_loss": 0.000266}
[03/28 01:35:03] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "6090", "eta": "0:00:46", "loss": 0.124042, "lr": 0.010066, "mode": "train", "time_backward": 1.056414, "time_data": 0.017062, "time_diff": 1.475756, "time_forward": 0.397589, "time_loss": 0.000211}
[03/28 01:36:11] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "6100", "eta": "0:00:24", "loss": 0.117046, "lr": 0.010082, "mode": "train", "time_backward": 1.054410, "time_data": 0.017252, "time_diff": 1.475769, "time_forward": 0.398810, "time_loss": 0.000397}
[03/28 01:37:06] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "1", "cur_iter": "6110", "eta": "0:00:02", "loss": 0.114331, "lr": 0.010098, "mode": "train", "time_backward": 1.054294, "time_data": 0.028811, "time_diff": 1.811041, "time_forward": 0.721391, "time_loss": 0.000453}
[03/28 02:18:53] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "10", "eta": "3:44:42", "loss": 0.123567, "lr": 0.010115, "mode": "train", "time_backward": 1.056743, "time_data": 0.016965, "time_diff": 1.477906, "time_forward": 0.398725, "time_loss": 0.000238}
[03/28 02:20:08] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "20", "eta": "3:44:19", "loss": 0.116037, "lr": 0.010131, "mode": "train", "time_backward": 1.057069, "time_data": 0.017386, "time_diff": 1.480900, "time_forward": 0.399579, "time_loss": 0.000346}
[03/28 02:20:23] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "30", "eta": "3:43:57", "loss": 0.120723, "lr": 0.010147, "mode": "train", "time_backward": 1.091284, "time_data": 0.017089, "time_diff": 1.513388, "time_forward": 0.398441, "time_loss": 0.000274}
[03/28 02:22:47] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "40", "eta": "3:43:35", "loss": 0.125606, "lr": 0.010164, "mode": "train", "time_backward": 1.055138, "time_data": 0.017963, "time_diff": 1.476807, "time_forward": 0.398968, "time_loss": 0.000299}
[03/28 02:23:48] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "50", "eta": "3:43:13", "loss": 0.115466, "lr": 0.010180, "mode": "train", "time_backward": 1.057420, "time_data": 0.016831, "time_diff": 1.485769, "time_forward": 0.399727, "time_loss": 0.000268}
[03/28 02:24:07] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "60", "eta": "3:42:50", "loss": 0.110278, "lr": 0.010196, "mode": "train", "time_backward": 1.065637, "time_data": 0.017073, "time_diff": 1.484546, "time_forward": 0.398404, "time_loss": 0.000248}
[03/28 02:24:56] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "70", "eta": "3:42:28", "loss": 0.119527, "lr": 0.010213, "mode": "train", "time_backward": 1.056230, "time_data": 0.018843, "time_diff": 1.481015, "time_forward": 0.399500, "time_loss": 0.000386}
[03/28 02:25:11] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "80", "eta": "3:42:06", "loss": 0.119928, "lr": 0.010229, "mode": "train", "time_backward": 1.053207, "time_data": 0.017416, "time_diff": 1.476265, "time_forward": 0.398220, "time_loss": 0.000245}
[03/28 02:26:03] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "90", "eta": "3:41:45", "loss": 0.116709, "lr": 0.010245, "mode": "train", "time_backward": 1.099490, "time_data": 0.017099, "time_diff": 1.519662, "time_forward": 0.399578, "time_loss": 0.000308}
[03/28 02:27:08] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "100", "eta": "3:41:22", "loss": 0.132341, "lr": 0.010262, "mode": "train", "time_backward": 1.055596, "time_data": 0.017317, "time_diff": 1.479829, "time_forward": 0.399004, "time_loss": 0.000249}
[03/28 02:28:38] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "110", "eta": "3:41:00", "loss": 0.109903, "lr": 0.010278, "mode": "train", "time_backward": 1.057961, "time_data": 0.017564, "time_diff": 1.478651, "time_forward": 0.399801, "time_loss": 0.000347}
[03/28 02:30:09] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "120", "eta": "3:55:43", "loss": 0.116945, "lr": 0.010294, "mode": "train", "time_backward": 1.563228, "time_data": 74.118488, "time_diff": 77.016756, "time_forward": 1.180377, "time_loss": 0.012990}
[03/28 02:30:24] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "130", "eta": "3:55:19", "loss": 0.128964, "lr": 0.010311, "mode": "train", "time_backward": 1.060039, "time_data": 0.018715, "time_diff": 1.482576, "time_forward": 0.400198, "time_loss": 0.000388}
[03/28 02:31:03] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "140", "eta": "3:54:55", "loss": 0.116023, "lr": 0.010327, "mode": "train", "time_backward": 1.056424, "time_data": 0.016855, "time_diff": 1.475676, "time_forward": 0.398568, "time_loss": 0.000272}
[03/28 02:31:41] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "150", "eta": "3:54:29", "loss": 0.115922, "lr": 0.010343, "mode": "train", "time_backward": 1.057003, "time_data": 0.017149, "time_diff": 1.481642, "time_forward": 0.399222, "time_loss": 0.000259}
[03/28 02:32:24] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "160", "eta": "3:57:18", "loss": 0.115262, "lr": 0.010360, "mode": "train", "time_backward": 1.056425, "time_data": 16.157332, "time_diff": 17.632250, "time_forward": 0.414787, "time_loss": 0.000446}
[03/28 02:33:05] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "170", "eta": "3:56:58", "loss": 0.127334, "lr": 0.010376, "mode": "train", "time_backward": 1.060764, "time_data": 0.016727, "time_diff": 1.945808, "time_forward": 0.860788, "time_loss": 0.000337}
[03/28 02:33:59] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "180", "eta": "3:56:29", "loss": 0.130045, "lr": 0.010392, "mode": "train", "time_backward": 1.053437, "time_data": 0.018289, "time_diff": 1.474462, "time_forward": 0.399017, "time_loss": 0.000193}
[03/28 02:34:41] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "190", "eta": "3:56:05", "loss": 0.110903, "lr": 0.010409, "mode": "train", "time_backward": 1.073014, "time_data": 0.019942, "time_diff": 1.497260, "time_forward": 0.400951, "time_loss": 0.000417}
[03/28 02:35:12] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "200", "eta": "3:57:54", "loss": 0.119628, "lr": 0.010425, "mode": "train", "time_backward": 1.193419, "time_data": 11.048328, "time_diff": 12.782369, "time_forward": 0.533443, "time_loss": 0.000415}
[03/28 02:35:54] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "210", "eta": "4:00:26", "loss": 0.116338, "lr": 0.010441, "mode": "train", "time_backward": 16.193606, "time_data": 0.017018, "time_diff": 16.620990, "time_forward": 0.399262, "time_loss": 0.000228}
[03/28 02:37:10] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "220", "eta": "3:59:59", "loss": 0.114802, "lr": 0.010458, "mode": "train", "time_backward": 1.056013, "time_data": 0.017180, "time_diff": 1.476213, "time_forward": 0.399264, "time_loss": 0.000266}
[03/28 02:37:36] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "230", "eta": "3:59:34", "loss": 0.102238, "lr": 0.010474, "mode": "train", "time_backward": 1.056909, "time_data": 0.017103, "time_diff": 1.480295, "time_forward": 0.399072, "time_loss": 0.000253}
[03/28 02:38:55] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "240", "eta": "3:59:09", "loss": 0.120919, "lr": 0.010490, "mode": "train", "time_backward": 1.053965, "time_data": 0.016810, "time_diff": 1.475187, "time_forward": 0.397142, "time_loss": 0.000288}
[03/28 02:39:28] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "250", "eta": "3:58:44", "loss": 0.116543, "lr": 0.010507, "mode": "train", "time_backward": 1.062887, "time_data": 0.017204, "time_diff": 1.510788, "time_forward": 0.400403, "time_loss": 0.000411}
[03/28 02:39:43] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "260", "eta": "3:58:19", "loss": 0.117683, "lr": 0.010523, "mode": "train", "time_backward": 1.055228, "time_data": 0.017112, "time_diff": 1.479165, "time_forward": 0.398764, "time_loss": 0.000339}
[03/28 02:41:08] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "270", "eta": "3:57:55", "loss": 0.107455, "lr": 0.010539, "mode": "train", "time_backward": 1.054609, "time_data": 0.017283, "time_diff": 1.476098, "time_forward": 0.398857, "time_loss": 0.000263}
[03/28 02:41:23] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "280", "eta": "3:57:31", "loss": 0.111310, "lr": 0.010555, "mode": "train", "time_backward": 1.056901, "time_data": 0.016919, "time_diff": 1.478881, "time_forward": 0.398717, "time_loss": 0.000262}
[03/28 02:41:51] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "290", "eta": "3:57:06", "loss": 0.100611, "lr": 0.010572, "mode": "train", "time_backward": 1.069825, "time_data": 0.017015, "time_diff": 1.494070, "time_forward": 0.402564, "time_loss": 0.001149}
[03/28 02:43:01] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "300", "eta": "3:56:42", "loss": 0.109336, "lr": 0.010588, "mode": "train", "time_backward": 1.054894, "time_data": 0.017733, "time_diff": 1.480000, "time_forward": 0.399266, "time_loss": 0.000265}
[03/28 02:43:24] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "310", "eta": "3:54:32", "loss": 0.113755, "lr": 0.010604, "mode": "train", "time_backward": 1.056254, "time_data": 0.019256, "time_diff": 1.479042, "time_forward": 0.399906, "time_loss": 0.000437}
[03/28 02:43:39] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "320", "eta": "3:54:08", "loss": 0.120640, "lr": 0.010621, "mode": "train", "time_backward": 1.059041, "time_data": 0.017248, "time_diff": 1.518303, "time_forward": 0.438874, "time_loss": 0.000261}
[03/28 02:45:06] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "330", "eta": "3:54:22", "loss": 0.117056, "lr": 0.010637, "mode": "train", "time_backward": 4.341770, "time_data": 0.016913, "time_diff": 4.811771, "time_forward": 0.406738, "time_loss": 0.000486}
[03/28 02:45:27] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "340", "eta": "3:53:58", "loss": 0.113763, "lr": 0.010653, "mode": "train", "time_backward": 1.069079, "time_data": 0.018619, "time_diff": 1.590446, "time_forward": 0.397974, "time_loss": 0.000239}
[03/28 02:45:43] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "350", "eta": "3:53:34", "loss": 0.120087, "lr": 0.010670, "mode": "train", "time_backward": 1.054117, "time_data": 0.017369, "time_diff": 1.475565, "time_forward": 0.399592, "time_loss": 0.000319}
[03/28 02:45:58] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "360", "eta": "3:53:10", "loss": 0.136306, "lr": 0.010686, "mode": "train", "time_backward": 1.083047, "time_data": 0.017796, "time_diff": 1.536876, "time_forward": 0.412223, "time_loss": 0.000350}
[03/28 02:46:19] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "370", "eta": "3:53:08", "loss": 0.118030, "lr": 0.010702, "mode": "train", "time_backward": 2.290515, "time_data": 0.248609, "time_diff": 3.530142, "time_forward": 0.985864, "time_loss": 0.000302}
[03/28 02:46:49] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "380", "eta": "3:52:44", "loss": 0.104768, "lr": 0.010719, "mode": "train", "time_backward": 1.061100, "time_data": 0.018755, "time_diff": 1.495123, "time_forward": 0.399897, "time_loss": 0.000362}
[03/28 02:47:05] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "390", "eta": "3:50:02", "loss": 0.123196, "lr": 0.010735, "mode": "train", "time_backward": 1.123632, "time_data": 0.019760, "time_diff": 1.544873, "time_forward": 0.398368, "time_loss": 0.000238}
[03/28 02:47:22] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "400", "eta": "3:49:39", "loss": 0.108486, "lr": 0.010751, "mode": "train", "time_backward": 1.064243, "time_data": 0.017800, "time_diff": 1.494734, "time_forward": 0.409163, "time_loss": 0.000338}
[03/28 02:48:04] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "410", "eta": "3:49:15", "loss": 0.125914, "lr": 0.010768, "mode": "train", "time_backward": 1.125281, "time_data": 0.017638, "time_diff": 1.548093, "time_forward": 0.398754, "time_loss": 0.000311}
[03/28 02:48:19] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "420", "eta": "3:48:51", "loss": 0.105292, "lr": 0.010784, "mode": "train", "time_backward": 1.057312, "time_data": 0.017517, "time_diff": 1.490624, "time_forward": 0.398147, "time_loss": 0.000237}
[03/28 02:48:41] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "430", "eta": "3:48:27", "loss": 0.126374, "lr": 0.010800, "mode": "train", "time_backward": 1.087285, "time_data": 0.017927, "time_diff": 1.568559, "time_forward": 0.462513, "time_loss": 0.000541}
[03/28 02:49:53] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "440", "eta": "3:58:46", "loss": 0.099466, "lr": 0.010817, "mode": "train", "time_backward": 57.619683, "time_data": 0.017578, "time_diff": 58.225838, "time_forward": 0.399215, "time_loss": 0.000222}
[03/28 02:50:09] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "450", "eta": "3:57:00", "loss": 0.121853, "lr": 0.010833, "mode": "train", "time_backward": 1.054669, "time_data": 0.017263, "time_diff": 1.480529, "time_forward": 0.397819, "time_loss": 0.000290}
[03/28 02:50:24] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "460", "eta": "3:56:35", "loss": 0.115581, "lr": 0.010849, "mode": "train", "time_backward": 1.055468, "time_data": 0.018231, "time_diff": 1.497560, "time_forward": 0.418264, "time_loss": 0.000353}
[03/28 02:50:45] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "470", "eta": "3:56:14", "loss": 0.123134, "lr": 0.010866, "mode": "train", "time_backward": 1.102523, "time_data": 0.042095, "time_diff": 1.899644, "time_forward": 0.747593, "time_loss": 0.000411}
[03/28 02:51:01] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "480", "eta": "3:55:49", "loss": 0.134198, "lr": 0.010882, "mode": "train", "time_backward": 1.060115, "time_data": 0.017248, "time_diff": 1.482954, "time_forward": 0.398220, "time_loss": 0.000334}
[03/28 02:51:18] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "490", "eta": "3:55:24", "loss": 0.118235, "lr": 0.010898, "mode": "train", "time_backward": 1.061315, "time_data": 0.017483, "time_diff": 1.485024, "time_forward": 0.402266, "time_loss": 0.000630}
[03/28 02:51:33] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "500", "eta": "3:54:59", "loss": 0.131281, "lr": 0.010915, "mode": "train", "time_backward": 1.085266, "time_data": 0.017799, "time_diff": 1.512853, "time_forward": 0.399839, "time_loss": 0.000377}
[03/28 02:51:48] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "510", "eta": "3:54:33", "loss": 0.120358, "lr": 0.010931, "mode": "train", "time_backward": 1.056450, "time_data": 0.017489, "time_diff": 1.477595, "time_forward": 0.400148, "time_loss": 0.000395}
[03/28 02:52:25] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "520", "eta": "3:54:08", "loss": 0.116238, "lr": 0.010947, "mode": "train", "time_backward": 1.118581, "time_data": 0.019081, "time_diff": 1.546287, "time_forward": 0.401622, "time_loss": 0.000427}
[03/28 02:52:41] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "530", "eta": "3:52:11", "loss": 0.125499, "lr": 0.010964, "mode": "train", "time_backward": 1.110030, "time_data": 0.018773, "time_diff": 1.539414, "time_forward": 0.397861, "time_loss": 0.000264}
[03/28 02:53:03] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "540", "eta": "3:51:46", "loss": 0.122895, "lr": 0.010980, "mode": "train", "time_backward": 1.058704, "time_data": 0.017374, "time_diff": 1.489274, "time_forward": 0.404458, "time_loss": 0.000301}
[03/28 02:53:21] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "550", "eta": "3:51:35", "loss": 0.113407, "lr": 0.010996, "mode": "train", "time_backward": 2.277616, "time_data": 0.033779, "time_diff": 2.789905, "time_forward": 0.409422, "time_loss": 0.000316}
[03/28 02:53:37] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "560", "eta": "3:51:10", "loss": 0.121854, "lr": 0.011013, "mode": "train", "time_backward": 1.061427, "time_data": 0.029979, "time_diff": 1.507487, "time_forward": 0.408865, "time_loss": 0.000338}
[03/28 02:53:53] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "570", "eta": "3:50:46", "loss": 0.120926, "lr": 0.011029, "mode": "train", "time_backward": 1.067446, "time_data": 0.020693, "time_diff": 1.601161, "time_forward": 0.509261, "time_loss": 0.000398}
[03/28 02:54:09] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "580", "eta": "3:50:24", "loss": 0.105399, "lr": 0.011045, "mode": "train", "time_backward": 1.104244, "time_data": 0.019707, "time_diff": 1.844134, "time_forward": 0.712654, "time_loss": 0.000470}
[03/28 02:54:24] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "590", "eta": "3:49:59", "loss": 0.121810, "lr": 0.011062, "mode": "train", "time_backward": 1.097230, "time_data": 0.020684, "time_diff": 1.546859, "time_forward": 0.427956, "time_loss": 0.000547}
[03/28 02:54:55] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "600", "eta": "3:49:33", "loss": 0.111016, "lr": 0.011078, "mode": "train", "time_backward": 1.054226, "time_data": 0.019969, "time_diff": 1.478543, "time_forward": 0.400505, "time_loss": 0.000279}
[03/28 02:55:37] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "610", "eta": "3:49:09", "loss": 0.117164, "lr": 0.011094, "mode": "train", "time_backward": 1.143569, "time_data": 0.019172, "time_diff": 1.614553, "time_forward": 0.413958, "time_loss": 0.000277}
[03/28 02:56:02] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "620", "eta": "3:48:46", "loss": 0.120362, "lr": 0.011111, "mode": "train", "time_backward": 1.075767, "time_data": 0.026315, "time_diff": 1.622874, "time_forward": 0.442193, "time_loss": 0.004910}
[03/28 02:56:18] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "630", "eta": "3:48:21", "loss": 0.129777, "lr": 0.011127, "mode": "train", "time_backward": 1.053309, "time_data": 0.021644, "time_diff": 1.508503, "time_forward": 0.398867, "time_loss": 0.029941}
[03/28 02:56:33] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "640", "eta": "3:47:56", "loss": 0.110361, "lr": 0.011143, "mode": "train", "time_backward": 1.065020, "time_data": 0.018441, "time_diff": 1.535978, "time_forward": 0.399694, "time_loss": 0.000361}
[03/28 02:56:49] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "650", "eta": "3:47:31", "loss": 0.118847, "lr": 0.011160, "mode": "train", "time_backward": 1.077410, "time_data": 0.017805, "time_diff": 1.498684, "time_forward": 0.399983, "time_loss": 0.000352}
[03/28 02:57:06] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "660", "eta": "3:47:08", "loss": 0.101707, "lr": 0.011176, "mode": "train", "time_backward": 1.095307, "time_data": 0.075248, "time_diff": 1.664306, "time_forward": 0.465360, "time_loss": 0.000349}
[03/28 02:57:21] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "670", "eta": "3:46:42", "loss": 0.101894, "lr": 0.011192, "mode": "train", "time_backward": 1.055841, "time_data": 0.016968, "time_diff": 1.475879, "time_forward": 0.398907, "time_loss": 0.000329}
[03/28 02:58:15] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "680", "eta": "3:46:08", "loss": 0.112394, "lr": 0.011209, "mode": "train", "time_backward": 1.055139, "time_data": 0.017117, "time_diff": 1.488204, "time_forward": 0.399434, "time_loss": 0.000293}
[03/28 02:58:31] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "690", "eta": "3:45:43", "loss": 0.119813, "lr": 0.011225, "mode": "train", "time_backward": 1.082972, "time_data": 0.018796, "time_diff": 1.557593, "time_forward": 0.425273, "time_loss": 0.000487}
[03/28 02:58:47] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "700", "eta": "3:45:19", "loss": 0.122713, "lr": 0.011241, "mode": "train", "time_backward": 1.061598, "time_data": 0.016835, "time_diff": 1.564439, "time_forward": 0.482311, "time_loss": 0.000327}
[03/28 02:59:12] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "710", "eta": "3:44:54", "loss": 0.116802, "lr": 0.011257, "mode": "train", "time_backward": 1.067718, "time_data": 0.022460, "time_diff": 1.523118, "time_forward": 0.423385, "time_loss": 0.000296}
[03/28 02:59:33] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "720", "eta": "3:45:30", "loss": 0.119863, "lr": 0.011274, "mode": "train", "time_backward": 5.936661, "time_data": 0.647997, "time_diff": 7.081486, "time_forward": 0.439153, "time_loss": 0.000255}
[03/28 02:59:51] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "730", "eta": "3:45:05", "loss": 0.121900, "lr": 0.011290, "mode": "train", "time_backward": 1.054426, "time_data": 0.057698, "time_diff": 1.531675, "time_forward": 0.400898, "time_loss": 0.000274}
[03/28 03:00:09] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "740", "eta": "3:44:40", "loss": 0.120631, "lr": 0.011306, "mode": "train", "time_backward": 1.061046, "time_data": 0.017686, "time_diff": 1.486123, "time_forward": 0.397941, "time_loss": 0.000224}
[03/28 03:00:29] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "750", "eta": "3:44:18", "loss": 0.120247, "lr": 0.011323, "mode": "train", "time_backward": 1.204401, "time_data": 0.016702, "time_diff": 1.805888, "time_forward": 0.581211, "time_loss": 0.000250}
[03/28 03:00:50] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "760", "eta": "3:43:53", "loss": 0.116246, "lr": 0.011339, "mode": "train", "time_backward": 1.063116, "time_data": 0.017605, "time_diff": 1.483727, "time_forward": 0.400120, "time_loss": 0.000390}
[03/28 03:02:03] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "770", "eta": "3:39:52", "loss": 0.126908, "lr": 0.011355, "mode": "train", "time_backward": 1.057720, "time_data": 0.017981, "time_diff": 1.522872, "time_forward": 0.398810, "time_loss": 0.000252}
[03/28 03:02:35] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "780", "eta": "3:40:27", "loss": 0.126359, "lr": 0.011372, "mode": "train", "time_backward": 6.640251, "time_data": 0.017610, "time_diff": 7.066037, "time_forward": 0.399875, "time_loss": 0.000352}
[03/28 03:03:11] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "790", "eta": "3:42:53", "loss": 0.112809, "lr": 0.011388, "mode": "train", "time_backward": 17.060418, "time_data": 0.017002, "time_diff": 17.576560, "time_forward": 0.398771, "time_loss": 0.000254}
[03/28 03:03:33] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "800", "eta": "3:43:38", "loss": 0.112922, "lr": 0.011404, "mode": "train", "time_backward": 7.720478, "time_data": 0.017033, "time_diff": 8.139397, "time_forward": 0.398629, "time_loss": 0.000290}
[03/28 03:04:25] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "810", "eta": "3:43:13", "loss": 0.144267, "lr": 0.011421, "mode": "train", "time_backward": 1.061587, "time_data": 0.017404, "time_diff": 1.481575, "time_forward": 0.399789, "time_loss": 0.000430}
[03/28 03:04:45] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "820", "eta": "3:42:48", "loss": 0.107384, "lr": 0.011437, "mode": "train", "time_backward": 1.056827, "time_data": 0.019096, "time_diff": 1.478782, "time_forward": 0.399233, "time_loss": 0.000404}
[03/28 03:05:19] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "830", "eta": "3:42:23", "loss": 0.115282, "lr": 0.011453, "mode": "train", "time_backward": 1.062474, "time_data": 0.017494, "time_diff": 1.518580, "time_forward": 0.399809, "time_loss": 0.000404}
[03/28 03:06:04] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "840", "eta": "3:41:58", "loss": 0.118832, "lr": 0.011470, "mode": "train", "time_backward": 1.073219, "time_data": 0.017853, "time_diff": 1.506481, "time_forward": 0.399886, "time_loss": 0.000360}
[03/28 03:06:20] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "850", "eta": "3:38:45", "loss": 0.122168, "lr": 0.011486, "mode": "train", "time_backward": 1.066206, "time_data": 0.016822, "time_diff": 1.490103, "time_forward": 0.399230, "time_loss": 0.000229}
[03/28 03:06:39] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "860", "eta": "3:38:26", "loss": 0.110833, "lr": 0.011502, "mode": "train", "time_backward": 1.262496, "time_data": 0.016965, "time_diff": 2.000487, "time_forward": 0.709997, "time_loss": 0.000419}
[03/28 03:07:07] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "870", "eta": "3:39:23", "loss": 0.112911, "lr": 0.011519, "mode": "train", "time_backward": 8.911412, "time_data": 0.017385, "time_diff": 9.338643, "time_forward": 0.399345, "time_loss": 0.000369}
[03/28 03:07:40] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "880", "eta": "3:38:58", "loss": 0.112879, "lr": 0.011535, "mode": "train", "time_backward": 1.053603, "time_data": 0.016651, "time_diff": 1.471808, "time_forward": 0.396821, "time_loss": 0.000194}
[03/28 03:08:38] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "890", "eta": "3:38:47", "loss": 0.107146, "lr": 0.011551, "mode": "train", "time_backward": 1.203229, "time_data": 0.231529, "time_diff": 2.865221, "time_forward": 1.320417, "time_loss": 0.035604}
[03/28 03:08:54] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "900", "eta": "3:38:20", "loss": 0.143427, "lr": 0.011568, "mode": "train", "time_backward": 1.065613, "time_data": 0.017233, "time_diff": 1.489729, "time_forward": 0.399897, "time_loss": 0.000378}
[03/28 03:09:19] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "910", "eta": "3:37:56", "loss": 0.128153, "lr": 0.011584, "mode": "train", "time_backward": 1.079396, "time_data": 0.070215, "time_diff": 1.670734, "time_forward": 0.514511, "time_loss": 0.000489}
[03/28 03:10:36] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "920", "eta": "3:38:52", "loss": 0.111032, "lr": 0.011600, "mode": "train", "time_backward": 8.901450, "time_data": 0.017143, "time_diff": 9.325295, "time_forward": 0.401825, "time_loss": 0.000352}
[03/28 03:10:51] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "930", "eta": "3:38:27", "loss": 0.127673, "lr": 0.011617, "mode": "train", "time_backward": 1.066918, "time_data": 0.018200, "time_diff": 1.486686, "time_forward": 0.398218, "time_loss": 0.000231}
[03/28 03:11:40] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "940", "eta": "3:38:05", "loss": 0.112573, "lr": 0.011633, "mode": "train", "time_backward": 1.101134, "time_data": 0.103179, "time_diff": 1.843554, "time_forward": 0.559936, "time_loss": 0.000275}
[03/28 03:12:06] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "950", "eta": "3:39:34", "loss": 0.115584, "lr": 0.011649, "mode": "train", "time_backward": 12.136576, "time_data": 0.017258, "time_diff": 12.561253, "time_forward": 0.399383, "time_loss": 0.000291}
[03/28 03:12:24] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "960", "eta": "3:39:09", "loss": 0.128069, "lr": 0.011666, "mode": "train", "time_backward": 1.055144, "time_data": 0.017153, "time_diff": 1.486742, "time_forward": 0.406788, "time_loss": 0.000274}
[03/28 03:13:35] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "970", "eta": "3:38:42", "loss": 0.120187, "lr": 0.011682, "mode": "train", "time_backward": 1.059906, "time_data": 0.018918, "time_diff": 1.490084, "time_forward": 0.399198, "time_loss": 0.000259}
[03/28 03:13:59] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "980", "eta": "3:38:15", "loss": 0.109744, "lr": 0.011698, "mode": "train", "time_backward": 1.056410, "time_data": 0.017162, "time_diff": 1.481179, "time_forward": 0.398949, "time_loss": 0.000422}
[03/28 03:15:12] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "990", "eta": "3:39:45", "loss": 0.112441, "lr": 0.011715, "mode": "train", "time_backward": 12.454815, "time_data": 0.016964, "time_diff": 12.877312, "time_forward": 0.399810, "time_loss": 0.000291}
[03/28 03:15:42] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "1000", "eta": "3:39:19", "loss": 0.110973, "lr": 0.011731, "mode": "train", "time_backward": 1.054787, "time_data": 0.016909, "time_diff": 1.472272, "time_forward": 0.397099, "time_loss": 0.000214}
[03/28 03:16:11] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "1010", "eta": "3:38:53", "loss": 0.111221, "lr": 0.011747, "mode": "train", "time_backward": 1.052957, "time_data": 0.017339, "time_diff": 1.478419, "time_forward": 0.398571, "time_loss": 0.000204}
[03/28 03:17:44] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "1020", "eta": "3:38:28", "loss": 0.120304, "lr": 0.011764, "mode": "train", "time_backward": 1.088152, "time_data": 0.016903, "time_diff": 1.578793, "time_forward": 0.464996, "time_loss": 0.000440}
[03/28 03:17:59] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "1030", "eta": "3:38:01", "loss": 0.124737, "lr": 0.011780, "mode": "train", "time_backward": 1.056198, "time_data": 0.017931, "time_diff": 1.477565, "time_forward": 0.398719, "time_loss": 0.000238}
[03/28 03:18:22] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "1040", "eta": "3:37:36", "loss": 0.114309, "lr": 0.011796, "mode": "train", "time_backward": 1.101722, "time_data": 0.018013, "time_diff": 1.549830, "time_forward": 0.401061, "time_loss": 0.000442}
[03/28 03:18:43] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "1050", "eta": "3:38:11", "loss": 0.117932, "lr": 0.011813, "mode": "train", "time_backward": 7.059969, "time_data": 0.023158, "time_diff": 7.500394, "time_forward": 0.411340, "time_loss": 0.000329}
[03/28 03:19:02] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "1060", "eta": "3:37:46", "loss": 0.109971, "lr": 0.011829, "mode": "train", "time_backward": 1.156579, "time_data": 0.017294, "time_diff": 1.606360, "time_forward": 0.398546, "time_loss": 0.000272}
[03/28 03:19:17] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "1070", "eta": "3:37:00", "loss": 0.124306, "lr": 0.011845, "mode": "train", "time_backward": 1.095594, "time_data": 0.017157, "time_diff": 1.517923, "time_forward": 0.398890, "time_loss": 0.000460}
[03/28 03:19:32] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "1080", "eta": "3:36:34", "loss": 0.112951, "lr": 0.011862, "mode": "train", "time_backward": 1.058622, "time_data": 0.017184, "time_diff": 1.479460, "time_forward": 0.398769, "time_loss": 0.000345}
[03/28 03:21:00] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "1090", "eta": "3:36:08", "loss": 0.123724, "lr": 0.011878, "mode": "train", "time_backward": 1.099034, "time_data": 0.038681, "time_diff": 1.580893, "time_forward": 0.432499, "time_loss": 0.000376}
[03/28 03:21:17] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "1100", "eta": "3:35:42", "loss": 0.126098, "lr": 0.011894, "mode": "train", "time_backward": 1.060474, "time_data": 0.018920, "time_diff": 1.523032, "time_forward": 0.407771, "time_loss": 0.000283}
[03/28 03:21:34] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "1110", "eta": "3:35:16", "loss": 0.107919, "lr": 0.011911, "mode": "train", "time_backward": 1.053955, "time_data": 0.017381, "time_diff": 1.496722, "time_forward": 0.400222, "time_loss": 0.000318}
[03/28 03:22:15] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "1120", "eta": "3:34:50", "loss": 0.107022, "lr": 0.011927, "mode": "train", "time_backward": 1.077379, "time_data": 0.016884, "time_diff": 1.498232, "time_forward": 0.400321, "time_loss": 0.000282}
[03/28 03:23:44] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "1130", "eta": "3:45:01", "loss": 0.106572, "lr": 0.011943, "mode": "train", "time_backward": 64.988367, "time_data": 0.017516, "time_diff": 65.414035, "time_forward": 0.400647, "time_loss": 0.000308}
[03/28 03:23:59] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "1140", "eta": "3:44:34", "loss": 0.122979, "lr": 0.011959, "mode": "train", "time_backward": 1.085742, "time_data": 0.016814, "time_diff": 1.528404, "time_forward": 0.403629, "time_loss": 0.000290}
[03/28 03:24:14] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "1150", "eta": "3:44:07", "loss": 0.100284, "lr": 0.011976, "mode": "train", "time_backward": 1.056538, "time_data": 0.018422, "time_diff": 1.477909, "time_forward": 0.399511, "time_loss": 0.000314}
[03/28 03:24:34] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "1160", "eta": "3:43:40", "loss": 0.112329, "lr": 0.011992, "mode": "train", "time_backward": 1.053823, "time_data": 0.016971, "time_diff": 1.476189, "time_forward": 0.400883, "time_loss": 0.000290}
[03/28 03:25:42] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "1170", "eta": "3:43:12", "loss": 0.117238, "lr": 0.012008, "mode": "train", "time_backward": 1.132150, "time_data": 0.017124, "time_diff": 1.563183, "time_forward": 0.399895, "time_loss": 0.000398}
[03/28 03:25:57] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "1180", "eta": "3:42:47", "loss": 0.103771, "lr": 0.012025, "mode": "train", "time_backward": 1.056423, "time_data": 0.017553, "time_diff": 1.665620, "time_forward": 0.398681, "time_loss": 0.000280}
[03/28 03:26:14] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "1190", "eta": "3:42:19", "loss": 0.125221, "lr": 0.012041, "mode": "train", "time_backward": 1.117007, "time_data": 0.017977, "time_diff": 1.546459, "time_forward": 0.399384, "time_loss": 0.000343}
[03/28 03:26:30] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "1200", "eta": "3:41:51", "loss": 0.102778, "lr": 0.012057, "mode": "train", "time_backward": 1.065546, "time_data": 0.020653, "time_diff": 1.506133, "time_forward": 0.411593, "time_loss": 0.000341}
[03/28 03:26:47] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "1210", "eta": "3:41:23", "loss": 0.113552, "lr": 0.012074, "mode": "train", "time_backward": 1.055282, "time_data": 0.023077, "time_diff": 1.537824, "time_forward": 0.440741, "time_loss": 0.000365}
[03/28 03:27:03] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "1220", "eta": "3:40:56", "loss": 0.116830, "lr": 0.012090, "mode": "train", "time_backward": 1.117287, "time_data": 0.038604, "time_diff": 1.574898, "time_forward": 0.401497, "time_loss": 0.000260}
[03/28 03:27:19] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "1230", "eta": "3:40:29", "loss": 0.108488, "lr": 0.012106, "mode": "train", "time_backward": 1.064843, "time_data": 0.018957, "time_diff": 1.504989, "time_forward": 0.414074, "time_loss": 0.000352}
[03/28 03:27:36] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "1240", "eta": "3:40:02", "loss": 0.104863, "lr": 0.012123, "mode": "train", "time_backward": 1.060163, "time_data": 0.017329, "time_diff": 1.486294, "time_forward": 0.399102, "time_loss": 0.000271}
[03/28 03:28:24] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "1250", "eta": "3:39:37", "loss": 0.109492, "lr": 0.012139, "mode": "train", "time_backward": 1.088592, "time_data": 0.139625, "time_diff": 1.782287, "time_forward": 0.545415, "time_loss": 0.000768}
[03/28 03:28:40] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "1260", "eta": "3:39:11", "loss": 0.110122, "lr": 0.012155, "mode": "train", "time_backward": 1.079527, "time_data": 0.018452, "time_diff": 1.515748, "time_forward": 0.399334, "time_loss": 0.000231}
[03/28 03:28:55] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "1270", "eta": "3:38:44", "loss": 0.099165, "lr": 0.012172, "mode": "train", "time_backward": 1.076501, "time_data": 0.018098, "time_diff": 1.504233, "time_forward": 0.398411, "time_loss": 0.000318}
[03/28 03:29:11] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "1280", "eta": "3:38:17", "loss": 0.119235, "lr": 0.012188, "mode": "train", "time_backward": 1.071998, "time_data": 0.016861, "time_diff": 1.492260, "time_forward": 0.399712, "time_loss": 0.000316}
[03/28 03:29:29] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "1290", "eta": "3:38:11", "loss": 0.127187, "lr": 0.012204, "mode": "train", "time_backward": 1.251568, "time_data": 0.038040, "time_diff": 3.723948, "time_forward": 2.357381, "time_loss": 0.002247}
[03/28 03:29:44] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "1300", "eta": "3:37:43", "loss": 0.108046, "lr": 0.012221, "mode": "train", "time_backward": 1.057028, "time_data": 0.021207, "time_diff": 1.499710, "time_forward": 0.413021, "time_loss": 0.000341}
[03/28 03:29:59] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "1310", "eta": "3:37:15", "loss": 0.115120, "lr": 0.012237, "mode": "train", "time_backward": 1.054008, "time_data": 0.017626, "time_diff": 1.477236, "time_forward": 0.401704, "time_loss": 0.000326}
[03/28 03:30:59] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "1320", "eta": "3:36:48", "loss": 0.109341, "lr": 0.012253, "mode": "train", "time_backward": 1.067476, "time_data": 0.017221, "time_diff": 1.491138, "time_forward": 0.402185, "time_loss": 0.000429}
[03/28 03:31:16] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "1330", "eta": "3:36:21", "loss": 0.104572, "lr": 0.012270, "mode": "train", "time_backward": 1.093738, "time_data": 0.017529, "time_diff": 1.581462, "time_forward": 0.467074, "time_loss": 0.000472}
[03/28 03:31:36] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "1340", "eta": "3:35:52", "loss": 0.133748, "lr": 0.012286, "mode": "train", "time_backward": 1.074520, "time_data": 0.017642, "time_diff": 1.499688, "time_forward": 0.399601, "time_loss": 0.000389}
[03/28 03:31:52] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "1350", "eta": "3:35:26", "loss": 0.113821, "lr": 0.012302, "mode": "train", "time_backward": 1.211817, "time_data": 0.016967, "time_diff": 1.680022, "time_forward": 0.447510, "time_loss": 0.000385}
[03/28 03:32:14] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "1360", "eta": "3:35:00", "loss": 0.120914, "lr": 0.012319, "mode": "train", "time_backward": 1.099873, "time_data": 0.017224, "time_diff": 1.583015, "time_forward": 0.401081, "time_loss": 0.000253}
[03/28 03:32:30] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "1370", "eta": "3:34:32", "loss": 0.108260, "lr": 0.012335, "mode": "train", "time_backward": 1.068347, "time_data": 0.018450, "time_diff": 1.500858, "time_forward": 0.401295, "time_loss": 0.000324}
[03/28 03:32:45] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "1380", "eta": "3:34:03", "loss": 0.122038, "lr": 0.012351, "mode": "train", "time_backward": 1.075100, "time_data": 0.018601, "time_diff": 1.498779, "time_forward": 0.398697, "time_loss": 0.000240}
[03/28 03:33:41] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "1390", "eta": "3:33:36", "loss": 0.111131, "lr": 0.012368, "mode": "train", "time_backward": 1.054166, "time_data": 0.017439, "time_diff": 1.475239, "time_forward": 0.399844, "time_loss": 0.000326}
[03/28 03:34:13] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "1400", "eta": "3:33:09", "loss": 0.105235, "lr": 0.012384, "mode": "train", "time_backward": 1.055939, "time_data": 0.017115, "time_diff": 1.476863, "time_forward": 0.399189, "time_loss": 0.000299}
[03/28 03:34:51] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "1410", "eta": "3:32:42", "loss": 0.104600, "lr": 0.012400, "mode": "train", "time_backward": 1.114244, "time_data": 0.017085, "time_diff": 1.595850, "time_forward": 0.399024, "time_loss": 0.000262}
[03/28 03:35:07] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "1420", "eta": "3:32:16", "loss": 0.134047, "lr": 0.012417, "mode": "train", "time_backward": 1.095470, "time_data": 0.017778, "time_diff": 1.540940, "time_forward": 0.401340, "time_loss": 0.000386}
[03/28 03:35:22] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "1430", "eta": "3:31:49", "loss": 0.107310, "lr": 0.012433, "mode": "train", "time_backward": 1.111392, "time_data": 0.017163, "time_diff": 1.582005, "time_forward": 0.398875, "time_loss": 0.000335}
[03/28 03:35:38] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "1440", "eta": "3:31:18", "loss": 0.115092, "lr": 0.012449, "mode": "train", "time_backward": 1.069943, "time_data": 0.017332, "time_diff": 1.495792, "time_forward": 0.399498, "time_loss": 0.000550}
[03/28 03:35:53] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "1450", "eta": "3:30:54", "loss": 0.113040, "lr": 0.012466, "mode": "train", "time_backward": 1.124094, "time_data": 0.016842, "time_diff": 1.791741, "time_forward": 0.620699, "time_loss": 0.000419}
[03/28 03:36:08] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "1460", "eta": "3:30:26", "loss": 0.111356, "lr": 0.012482, "mode": "train", "time_backward": 1.057659, "time_data": 0.019290, "time_diff": 1.487832, "time_forward": 0.402852, "time_loss": 0.000309}
[03/28 03:36:26] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "1470", "eta": "3:30:00", "loss": 0.112468, "lr": 0.012498, "mode": "train", "time_backward": 1.283132, "time_data": 0.017704, "time_diff": 1.726077, "time_forward": 0.399532, "time_loss": 0.000846}
[03/28 03:36:56] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "1480", "eta": "3:29:33", "loss": 0.116314, "lr": 0.012515, "mode": "train", "time_backward": 1.056336, "time_data": 0.018038, "time_diff": 1.480283, "time_forward": 0.398498, "time_loss": 0.000301}
[03/28 03:37:16] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "1490", "eta": "3:29:08", "loss": 0.116767, "lr": 0.012531, "mode": "train", "time_backward": 1.319740, "time_data": 0.016632, "time_diff": 1.742727, "time_forward": 0.398203, "time_loss": 0.000383}
[03/28 03:37:53] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "1500", "eta": "3:28:41", "loss": 0.112006, "lr": 0.012547, "mode": "train", "time_backward": 1.130839, "time_data": 0.017337, "time_diff": 1.574901, "time_forward": 0.398691, "time_loss": 0.000246}
[03/28 03:38:10] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "1510", "eta": "3:28:12", "loss": 0.104387, "lr": 0.012564, "mode": "train", "time_backward": 1.065411, "time_data": 0.016863, "time_diff": 1.487165, "time_forward": 0.398643, "time_loss": 0.000415}
[03/28 03:38:26] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "1520", "eta": "3:27:46", "loss": 0.122620, "lr": 0.012580, "mode": "train", "time_backward": 1.092994, "time_data": 0.019902, "time_diff": 1.542806, "time_forward": 0.404822, "time_loss": 0.000238}
[03/28 03:38:43] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "1530", "eta": "3:27:28", "loss": 0.125544, "lr": 0.012596, "mode": "train", "time_backward": 1.977599, "time_data": 0.017616, "time_diff": 2.448361, "time_forward": 0.407400, "time_loss": 0.000229}
[03/28 03:38:59] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "1540", "eta": "3:27:00", "loss": 0.118225, "lr": 0.012613, "mode": "train", "time_backward": 1.053722, "time_data": 0.027114, "time_diff": 1.486645, "time_forward": 0.398685, "time_loss": 0.000332}
[03/28 03:39:30] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "1550", "eta": "3:27:09", "loss": 0.101779, "lr": 0.012629, "mode": "train", "time_backward": 4.995159, "time_data": 0.017204, "time_diff": 5.417233, "time_forward": 0.399970, "time_loss": 0.000356}
[03/28 03:40:47] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "1560", "eta": "3:26:26", "loss": 0.106087, "lr": 0.012645, "mode": "train", "time_backward": 1.056565, "time_data": 0.018295, "time_diff": 1.477809, "time_forward": 0.398861, "time_loss": 0.000238}
[03/28 03:41:15] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "1570", "eta": "3:26:05", "loss": 0.124447, "lr": 0.012662, "mode": "train", "time_backward": 1.810757, "time_data": 0.017425, "time_diff": 2.232334, "time_forward": 0.397820, "time_loss": 0.000245}
[03/28 03:41:31] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "1580", "eta": "3:25:37", "loss": 0.120744, "lr": 0.012678, "mode": "train", "time_backward": 1.055893, "time_data": 0.032058, "time_diff": 1.550081, "time_forward": 0.458266, "time_loss": 0.000513}
[03/28 03:41:50] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "1590", "eta": "3:25:10", "loss": 0.112576, "lr": 0.012694, "mode": "train", "time_backward": 1.124918, "time_data": 0.022357, "time_diff": 1.552687, "time_forward": 0.398159, "time_loss": 0.000270}
[03/28 03:42:06] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "1600", "eta": "3:24:42", "loss": 0.100564, "lr": 0.012710, "mode": "train", "time_backward": 1.053956, "time_data": 0.016675, "time_diff": 1.491724, "time_forward": 0.397432, "time_loss": 0.000257}
[03/28 03:42:22] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "1610", "eta": "3:24:15", "loss": 0.107451, "lr": 0.012727, "mode": "train", "time_backward": 1.055064, "time_data": 0.017960, "time_diff": 1.480000, "time_forward": 0.400176, "time_loss": 0.000799}
[03/28 03:42:53] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "1620", "eta": "3:23:49", "loss": 0.107262, "lr": 0.012743, "mode": "train", "time_backward": 1.141714, "time_data": 0.017730, "time_diff": 1.622428, "time_forward": 0.424624, "time_loss": 0.000259}
[03/28 03:43:08] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "1630", "eta": "3:23:22", "loss": 0.105416, "lr": 0.012759, "mode": "train", "time_backward": 1.104713, "time_data": 0.017257, "time_diff": 1.582941, "time_forward": 0.399516, "time_loss": 0.000675}
[03/28 03:43:28] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "1640", "eta": "3:22:20", "loss": 0.110778, "lr": 0.012776, "mode": "train", "time_backward": 1.055777, "time_data": 0.018140, "time_diff": 1.477016, "time_forward": 0.399160, "time_loss": 0.000306}
[03/28 03:43:51] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "1650", "eta": "3:21:53", "loss": 0.109512, "lr": 0.012792, "mode": "train", "time_backward": 1.233126, "time_data": 0.016880, "time_diff": 1.652933, "time_forward": 0.402268, "time_loss": 0.000367}
[03/28 03:44:08] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "1660", "eta": "3:21:27", "loss": 0.108337, "lr": 0.012808, "mode": "train", "time_backward": 1.138304, "time_data": 0.018257, "time_diff": 1.598398, "time_forward": 0.430673, "time_loss": 0.000419}
[03/28 03:44:24] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "1670", "eta": "3:21:00", "loss": 0.115134, "lr": 0.012825, "mode": "train", "time_backward": 1.063733, "time_data": 0.017439, "time_diff": 1.486234, "time_forward": 0.397625, "time_loss": 0.000249}
[03/28 03:44:39] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "1680", "eta": "3:20:30", "loss": 0.108237, "lr": 0.012841, "mode": "train", "time_backward": 1.079636, "time_data": 0.030185, "time_diff": 1.514393, "time_forward": 0.401204, "time_loss": 0.000242}
[03/28 03:44:54] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "1690", "eta": "3:20:03", "loss": 0.113221, "lr": 0.012857, "mode": "train", "time_backward": 1.063802, "time_data": 0.017449, "time_diff": 1.487709, "time_forward": 0.400027, "time_loss": 0.000378}
[03/28 03:45:11] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "1700", "eta": "3:19:39", "loss": 0.118198, "lr": 0.012874, "mode": "train", "time_backward": 1.428279, "time_data": 0.027247, "time_diff": 1.975151, "time_forward": 0.512210, "time_loss": 0.000294}
[03/28 03:45:27] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "1710", "eta": "3:19:12", "loss": 0.110776, "lr": 0.012890, "mode": "train", "time_backward": 1.063280, "time_data": 0.032367, "time_diff": 1.532776, "time_forward": 0.399192, "time_loss": 0.035529}
[03/28 03:47:01] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "1720", "eta": "3:18:44", "loss": 0.120805, "lr": 0.012906, "mode": "train", "time_backward": 1.056969, "time_data": 0.017657, "time_diff": 1.480898, "time_forward": 0.399344, "time_loss": 0.000278}
[03/28 03:47:17] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "1730", "eta": "3:18:18", "loss": 0.102141, "lr": 0.012923, "mode": "train", "time_backward": 1.140506, "time_data": 0.020320, "time_diff": 1.571600, "time_forward": 0.400464, "time_loss": 0.000757}
[03/28 03:48:00] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "1740", "eta": "3:17:51", "loss": 0.123130, "lr": 0.012939, "mode": "train", "time_backward": 1.083439, "time_data": 0.017529, "time_diff": 1.555677, "time_forward": 0.451440, "time_loss": 0.000238}
[03/28 03:48:19] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "1750", "eta": "3:17:23", "loss": 0.125568, "lr": 0.012955, "mode": "train", "time_backward": 1.064594, "time_data": 0.018295, "time_diff": 1.485690, "time_forward": 0.399557, "time_loss": 0.000718}
[03/28 03:48:34] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "1760", "eta": "3:16:56", "loss": 0.105675, "lr": 0.012972, "mode": "train", "time_backward": 1.056741, "time_data": 0.019560, "time_diff": 1.480277, "time_forward": 0.400296, "time_loss": 0.000434}
[03/28 03:49:27] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "1770", "eta": "3:16:28", "loss": 0.102365, "lr": 0.012988, "mode": "train", "time_backward": 1.058404, "time_data": 0.021719, "time_diff": 1.497060, "time_forward": 0.397474, "time_loss": 0.000222}
[03/28 03:49:45] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "1780", "eta": "3:16:16", "loss": 0.100416, "lr": 0.013004, "mode": "train", "time_backward": 3.869333, "time_data": 0.017017, "time_diff": 4.288129, "time_forward": 0.400511, "time_loss": 0.000234}
[03/28 03:50:01] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "1790", "eta": "3:15:49", "loss": 0.117623, "lr": 0.013021, "mode": "train", "time_backward": 1.055563, "time_data": 0.018686, "time_diff": 1.481952, "time_forward": 0.403723, "time_loss": 0.000482}
[03/28 03:50:29] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "1800", "eta": "3:15:23", "loss": 0.107084, "lr": 0.013037, "mode": "train", "time_backward": 1.055012, "time_data": 0.016608, "time_diff": 1.625921, "time_forward": 0.546951, "time_loss": 0.000306}
[03/28 03:50:45] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "1810", "eta": "3:14:55", "loss": 0.105667, "lr": 0.013053, "mode": "train", "time_backward": 1.059517, "time_data": 0.017846, "time_diff": 1.480900, "time_forward": 0.399930, "time_loss": 0.000422}
[03/28 03:51:42] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "1820", "eta": "3:14:26", "loss": 0.109574, "lr": 0.013070, "mode": "train", "time_backward": 1.090014, "time_data": 0.017033, "time_diff": 1.508316, "time_forward": 0.397975, "time_loss": 0.000237}
[03/28 03:52:36] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "1830", "eta": "3:14:01", "loss": 0.101573, "lr": 0.013086, "mode": "train", "time_backward": 1.161799, "time_data": 0.021786, "time_diff": 1.815106, "time_forward": 0.624843, "time_loss": 0.000403}
[03/28 03:52:52] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "1840", "eta": "3:13:34", "loss": 0.105093, "lr": 0.013102, "mode": "train", "time_backward": 1.061696, "time_data": 0.016845, "time_diff": 1.523160, "time_forward": 0.398923, "time_loss": 0.000384}
[03/28 03:53:13] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "1850", "eta": "3:13:06", "loss": 0.104147, "lr": 0.013119, "mode": "train", "time_backward": 1.064207, "time_data": 0.020217, "time_diff": 1.495474, "time_forward": 0.408220, "time_loss": 0.000339}
[03/28 03:53:39] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "1860", "eta": "3:14:11", "loss": 0.114051, "lr": 0.013135, "mode": "train", "time_backward": 12.084241, "time_data": 0.017266, "time_diff": 12.542674, "time_forward": 0.399037, "time_loss": 0.000351}
[03/28 03:54:17] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "1870", "eta": "3:13:43", "loss": 0.110462, "lr": 0.013151, "mode": "train", "time_backward": 1.070526, "time_data": 0.018571, "time_diff": 1.500648, "time_forward": 0.406262, "time_loss": 0.000356}
[03/28 03:54:32] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "1880", "eta": "3:13:15", "loss": 0.128642, "lr": 0.013168, "mode": "train", "time_backward": 1.140132, "time_data": 0.019812, "time_diff": 1.578102, "time_forward": 0.400948, "time_loss": 0.012236}
[03/28 03:55:16] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "1890", "eta": "3:12:48", "loss": 0.108504, "lr": 0.013184, "mode": "train", "time_backward": 1.190360, "time_data": 0.016741, "time_diff": 1.613101, "time_forward": 0.398346, "time_loss": 0.000241}
[03/28 03:55:53] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "1900", "eta": "3:12:45", "loss": 0.100789, "lr": 0.013200, "mode": "train", "time_backward": 4.060241, "time_data": 0.017757, "time_diff": 4.487231, "time_forward": 0.403574, "time_loss": 0.000306}
[03/28 03:57:25] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "1910", "eta": "3:17:02", "loss": 0.124298, "lr": 0.013217, "mode": "train", "time_backward": 1.059366, "time_data": 33.833629, "time_diff": 35.314419, "time_forward": 0.417716, "time_loss": 0.000472}
[03/28 03:58:14] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "1920", "eta": "3:16:33", "loss": 0.100889, "lr": 0.013233, "mode": "train", "time_backward": 1.053468, "time_data": 0.016910, "time_diff": 1.475729, "time_forward": 0.399987, "time_loss": 0.000261}
[03/28 03:58:44] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "1930", "eta": "3:16:05", "loss": 0.103601, "lr": 0.013249, "mode": "train", "time_backward": 1.074913, "time_data": 0.023010, "time_diff": 1.540728, "time_forward": 0.399230, "time_loss": 0.000429}
[03/28 03:59:15] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "1940", "eta": "3:15:37", "loss": 0.106339, "lr": 0.013266, "mode": "train", "time_backward": 1.104440, "time_data": 0.017635, "time_diff": 1.526929, "time_forward": 0.399060, "time_loss": 0.000290}
[03/28 04:00:48] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "1950", "eta": "3:15:08", "loss": 0.101603, "lr": 0.013282, "mode": "train", "time_backward": 1.056569, "time_data": 0.016778, "time_diff": 1.475403, "time_forward": 0.398429, "time_loss": 0.000271}
[03/28 04:01:06] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "1960", "eta": "3:15:02", "loss": 0.109673, "lr": 0.013298, "mode": "train", "time_backward": 1.888193, "time_data": 0.159262, "time_diff": 4.144487, "time_forward": 1.966913, "time_loss": 0.126251}
[03/28 04:01:40] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "1970", "eta": "3:14:33", "loss": 0.107946, "lr": 0.013315, "mode": "train", "time_backward": 1.060998, "time_data": 0.017115, "time_diff": 1.479933, "time_forward": 0.399164, "time_loss": 0.000286}
[03/28 04:01:58] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "1980", "eta": "3:14:20", "loss": 0.104358, "lr": 0.013331, "mode": "train", "time_backward": 2.920937, "time_data": 0.017113, "time_diff": 3.355720, "time_forward": 0.399597, "time_loss": 0.000327}
[03/28 04:02:13] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "1990", "eta": "3:13:52", "loss": 0.115181, "lr": 0.013347, "mode": "train", "time_backward": 1.055139, "time_data": 0.017092, "time_diff": 1.535949, "time_forward": 0.452532, "time_loss": 0.000379}
[03/28 04:02:29] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "2000", "eta": "3:13:24", "loss": 0.109169, "lr": 0.013364, "mode": "train", "time_backward": 1.065762, "time_data": 0.016959, "time_diff": 1.498434, "time_forward": 0.408311, "time_loss": 0.000289}
[03/28 04:02:54] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "2010", "eta": "3:14:12", "loss": 0.112132, "lr": 0.013380, "mode": "train", "time_backward": 10.423530, "time_data": 0.018118, "time_diff": 10.849755, "time_forward": 0.400857, "time_loss": 0.000359}
[03/28 04:04:14] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "2020", "eta": "3:13:43", "loss": 0.113934, "lr": 0.013396, "mode": "train", "time_backward": 1.057244, "time_data": 0.017105, "time_diff": 1.481683, "time_forward": 0.398890, "time_loss": 0.000325}
[03/28 04:05:10] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "2030", "eta": "3:13:34", "loss": 0.119512, "lr": 0.013412, "mode": "train", "time_backward": 3.603419, "time_data": 0.016735, "time_diff": 4.026934, "time_forward": 0.398305, "time_loss": 0.000324}
[03/28 04:05:31] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "2040", "eta": "3:13:05", "loss": 0.085133, "lr": 0.013429, "mode": "train", "time_backward": 1.056687, "time_data": 0.018531, "time_diff": 1.481624, "time_forward": 0.400831, "time_loss": 0.000372}
[03/28 04:05:55] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "2050", "eta": "3:12:36", "loss": 0.110093, "lr": 0.013445, "mode": "train", "time_backward": 1.059482, "time_data": 0.018293, "time_diff": 1.481787, "time_forward": 0.400212, "time_loss": 0.000370}
[03/28 04:07:15] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "2060", "eta": "3:12:08", "loss": 0.111098, "lr": 0.013461, "mode": "train", "time_backward": 1.064401, "time_data": 0.016906, "time_diff": 1.483803, "time_forward": 0.399016, "time_loss": 0.000263}
[03/28 04:07:36] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "2070", "eta": "3:11:39", "loss": 0.112501, "lr": 0.013478, "mode": "train", "time_backward": 1.060239, "time_data": 0.023505, "time_diff": 1.487072, "time_forward": 0.399639, "time_loss": 0.000346}
[03/28 04:08:48] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "2080", "eta": "3:11:11", "loss": 0.118226, "lr": 0.013494, "mode": "train", "time_backward": 1.056513, "time_data": 0.018470, "time_diff": 1.527377, "time_forward": 0.448882, "time_loss": 0.000283}
[03/28 04:10:00] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "2090", "eta": "3:18:14", "loss": 0.111050, "lr": 0.013510, "mode": "train", "time_backward": 57.316243, "time_data": 0.017487, "time_diff": 57.753576, "time_forward": 0.399864, "time_loss": 0.000313}
[03/28 04:10:15] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "2100", "eta": "3:17:44", "loss": 0.130106, "lr": 0.013527, "mode": "train", "time_backward": 1.057569, "time_data": 0.017296, "time_diff": 1.478169, "time_forward": 0.399103, "time_loss": 0.000226}
[03/28 04:11:19] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "2110", "eta": "3:17:14", "loss": 0.115679, "lr": 0.013543, "mode": "train", "time_backward": 1.103884, "time_data": 0.017237, "time_diff": 1.528556, "time_forward": 0.399357, "time_loss": 0.000341}
[03/28 04:11:34] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "2120", "eta": "3:16:45", "loss": 0.117372, "lr": 0.013559, "mode": "train", "time_backward": 1.080559, "time_data": 0.017085, "time_diff": 1.504372, "time_forward": 0.398731, "time_loss": 0.000325}
[03/28 04:12:03] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "2130", "eta": "3:16:15", "loss": 0.120576, "lr": 0.013576, "mode": "train", "time_backward": 1.096768, "time_data": 0.017445, "time_diff": 1.520929, "time_forward": 0.399684, "time_loss": 0.000304}
[03/28 04:13:01] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "2140", "eta": "3:15:45", "loss": 0.127104, "lr": 0.013592, "mode": "train", "time_backward": 1.056404, "time_data": 0.024711, "time_diff": 1.485593, "time_forward": 0.399637, "time_loss": 0.000305}
[03/28 04:13:16] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "2150", "eta": "3:15:15", "loss": 0.105818, "lr": 0.013608, "mode": "train", "time_backward": 1.055772, "time_data": 0.017342, "time_diff": 1.480430, "time_forward": 0.399311, "time_loss": 0.000255}
[03/28 04:13:34] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "2160", "eta": "3:14:45", "loss": 0.106637, "lr": 0.013625, "mode": "train", "time_backward": 1.070273, "time_data": 0.017595, "time_diff": 1.502976, "time_forward": 0.399014, "time_loss": 0.000240}
[03/28 04:14:18] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "2170", "eta": "3:17:42", "loss": 0.092668, "lr": 0.013641, "mode": "train", "time_backward": 27.280081, "time_data": 0.017914, "time_diff": 27.732078, "time_forward": 0.402744, "time_loss": 0.000272}
[03/28 04:14:37] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "2180", "eta": "3:17:12", "loss": 0.114788, "lr": 0.013657, "mode": "train", "time_backward": 1.055274, "time_data": 0.017206, "time_diff": 1.476907, "time_forward": 0.397955, "time_loss": 0.000256}
[03/28 04:16:00] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "2190", "eta": "3:16:41", "loss": 0.098094, "lr": 0.013674, "mode": "train", "time_backward": 1.055795, "time_data": 0.017428, "time_diff": 1.478369, "time_forward": 0.398774, "time_loss": 0.000286}
[03/28 04:16:19] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "2200", "eta": "3:16:10", "loss": 0.106113, "lr": 0.013690, "mode": "train", "time_backward": 1.057554, "time_data": 0.018829, "time_diff": 1.484153, "time_forward": 0.400574, "time_loss": 0.000306}
[03/28 04:17:17] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "2210", "eta": "3:15:40", "loss": 0.102550, "lr": 0.013706, "mode": "train", "time_backward": 1.054958, "time_data": 0.017567, "time_diff": 1.475230, "time_forward": 0.398876, "time_loss": 0.000360}
[03/28 04:18:34] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "2220", "eta": "3:15:10", "loss": 0.117130, "lr": 0.013723, "mode": "train", "time_backward": 1.124012, "time_data": 0.016851, "time_diff": 1.543300, "time_forward": 0.398890, "time_loss": 0.000229}
[03/28 04:18:49] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "2230", "eta": "3:14:40", "loss": 0.103155, "lr": 0.013739, "mode": "train", "time_backward": 1.059061, "time_data": 0.017767, "time_diff": 1.480437, "time_forward": 0.399439, "time_loss": 0.000453}
[03/28 04:19:09] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "2240", "eta": "3:14:10", "loss": 0.103297, "lr": 0.013755, "mode": "train", "time_backward": 1.055087, "time_data": 0.017740, "time_diff": 1.496178, "time_forward": 0.398996, "time_loss": 0.000389}
[03/28 04:19:58] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "2250", "eta": "3:17:58", "loss": 0.117219, "lr": 0.013772, "mode": "train", "time_backward": 34.494514, "time_data": 0.016855, "time_diff": 34.930842, "time_forward": 0.399119, "time_loss": 0.000238}
[03/28 04:20:18] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "2260", "eta": "3:17:27", "loss": 0.109376, "lr": 0.013788, "mode": "train", "time_backward": 1.096906, "time_data": 0.017033, "time_diff": 1.538019, "time_forward": 0.399512, "time_loss": 0.000355}
[03/28 04:21:17] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "2270", "eta": "3:16:56", "loss": 0.120745, "lr": 0.013804, "mode": "train", "time_backward": 1.059424, "time_data": 0.017618, "time_diff": 1.478833, "time_forward": 0.398284, "time_loss": 0.000305}
[03/28 04:21:32] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "2280", "eta": "3:16:25", "loss": 0.104535, "lr": 0.013821, "mode": "train", "time_backward": 1.056580, "time_data": 0.019773, "time_diff": 1.483338, "time_forward": 0.398519, "time_loss": 0.000219}
[03/28 04:22:08] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "2290", "eta": "3:15:53", "loss": 0.113513, "lr": 0.013837, "mode": "train", "time_backward": 1.057708, "time_data": 0.016903, "time_diff": 1.479749, "time_forward": 0.398724, "time_loss": 0.000266}
[03/28 04:22:25] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "2300", "eta": "3:15:22", "loss": 0.108661, "lr": 0.013853, "mode": "train", "time_backward": 1.164828, "time_data": 0.017255, "time_diff": 1.589855, "time_forward": 0.400195, "time_loss": 0.000412}
[03/28 04:22:47] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "2310", "eta": "3:14:51", "loss": 0.104806, "lr": 0.013870, "mode": "train", "time_backward": 1.061033, "time_data": 0.017238, "time_diff": 1.483896, "time_forward": 0.401146, "time_loss": 0.002069}
[03/28 04:23:16] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "2320", "eta": "3:14:20", "loss": 0.106089, "lr": 0.013886, "mode": "train", "time_backward": 1.054634, "time_data": 0.017343, "time_diff": 1.483493, "time_forward": 0.399426, "time_loss": 0.000386}
[03/28 04:23:32] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "2330", "eta": "3:13:48", "loss": 0.100670, "lr": 0.013902, "mode": "train", "time_backward": 1.056445, "time_data": 0.018008, "time_diff": 1.478398, "time_forward": 0.400421, "time_loss": 0.000375}
[03/28 04:24:04] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "2340", "eta": "3:13:18", "loss": 0.115293, "lr": 0.013919, "mode": "train", "time_backward": 1.120243, "time_data": 0.019668, "time_diff": 1.592017, "time_forward": 0.399802, "time_loss": 0.000344}
[03/28 04:25:16] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "2350", "eta": "3:12:47", "loss": 0.103398, "lr": 0.013935, "mode": "train", "time_backward": 1.094401, "time_data": 0.018053, "time_diff": 1.520290, "time_forward": 0.399338, "time_loss": 0.000246}
[03/28 04:25:31] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "2360", "eta": "3:12:16", "loss": 0.093038, "lr": 0.013951, "mode": "train", "time_backward": 1.056103, "time_data": 0.017066, "time_diff": 1.477884, "time_forward": 0.398966, "time_loss": 0.000264}
[03/28 04:25:48] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "2370", "eta": "3:11:45", "loss": 0.106172, "lr": 0.013968, "mode": "train", "time_backward": 1.083385, "time_data": 0.017511, "time_diff": 1.506958, "time_forward": 0.398825, "time_loss": 0.000386}
[03/28 04:26:03] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "2380", "eta": "3:11:14", "loss": 0.111239, "lr": 0.013984, "mode": "train", "time_backward": 1.097281, "time_data": 0.017432, "time_diff": 1.532558, "time_forward": 0.400747, "time_loss": 0.000390}
[03/28 04:26:31] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "2390", "eta": "3:10:43", "loss": 0.100086, "lr": 0.014000, "mode": "train", "time_backward": 1.089726, "time_data": 0.018344, "time_diff": 1.518632, "time_forward": 0.403347, "time_loss": 0.000273}
[03/28 04:26:54] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "2400", "eta": "3:10:12", "loss": 0.098100, "lr": 0.014017, "mode": "train", "time_backward": 1.056834, "time_data": 0.017421, "time_diff": 1.480525, "time_forward": 0.399480, "time_loss": 0.000361}
[03/28 04:27:09] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "2410", "eta": "3:09:41", "loss": 0.122711, "lr": 0.014033, "mode": "train", "time_backward": 1.055339, "time_data": 0.017745, "time_diff": 1.482653, "time_forward": 0.399194, "time_loss": 0.000334}
[03/28 04:27:54] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "2420", "eta": "3:09:10", "loss": 0.105041, "lr": 0.014049, "mode": "train", "time_backward": 1.055567, "time_data": 0.017452, "time_diff": 1.479560, "time_forward": 0.399909, "time_loss": 0.000408}
[03/28 04:28:56] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "2430", "eta": "3:08:39", "loss": 0.116551, "lr": 0.014066, "mode": "train", "time_backward": 1.133977, "time_data": 0.017361, "time_diff": 1.556819, "time_forward": 0.399159, "time_loss": 0.000301}
[03/28 04:29:11] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "2440", "eta": "3:08:08", "loss": 0.104229, "lr": 0.014082, "mode": "train", "time_backward": 1.056724, "time_data": 0.016813, "time_diff": 1.483667, "time_forward": 0.396971, "time_loss": 0.000198}
[03/28 04:29:54] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "2450", "eta": "3:07:37", "loss": 0.109247, "lr": 0.014098, "mode": "train", "time_backward": 1.054837, "time_data": 0.017794, "time_diff": 1.481379, "time_forward": 0.398499, "time_loss": 0.000236}
[03/28 04:30:10] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "2460", "eta": "3:07:07", "loss": 0.108237, "lr": 0.014114, "mode": "train", "time_backward": 1.104720, "time_data": 0.017730, "time_diff": 1.526673, "time_forward": 0.400639, "time_loss": 0.000373}
[03/28 04:30:38] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "2470", "eta": "3:06:36", "loss": 0.106457, "lr": 0.014131, "mode": "train", "time_backward": 1.055353, "time_data": 0.020878, "time_diff": 1.478334, "time_forward": 0.398883, "time_loss": 0.000275}
[03/28 04:31:35] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "2480", "eta": "3:06:05", "loss": 0.121083, "lr": 0.014147, "mode": "train", "time_backward": 1.106737, "time_data": 0.020658, "time_diff": 1.540418, "time_forward": 0.408337, "time_loss": 0.000372}
[03/28 04:31:59] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "2490", "eta": "3:05:34", "loss": 0.098540, "lr": 0.014163, "mode": "train", "time_backward": 1.055881, "time_data": 0.020427, "time_diff": 1.487921, "time_forward": 0.405973, "time_loss": 0.000468}
[03/28 04:32:14] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "2500", "eta": "3:05:02", "loss": 0.122129, "lr": 0.014180, "mode": "train", "time_backward": 1.059229, "time_data": 0.019153, "time_diff": 1.489090, "time_forward": 0.397987, "time_loss": 0.000234}
[03/28 04:32:51] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "2510", "eta": "3:04:31", "loss": 0.113363, "lr": 0.014196, "mode": "train", "time_backward": 1.057414, "time_data": 0.016829, "time_diff": 1.484042, "time_forward": 0.406141, "time_loss": 0.000369}
[03/28 04:33:14] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "2520", "eta": "3:03:59", "loss": 0.106458, "lr": 0.014212, "mode": "train", "time_backward": 1.054978, "time_data": 0.017251, "time_diff": 1.478322, "time_forward": 0.399270, "time_loss": 0.000351}
[03/28 04:33:35] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "2530", "eta": "3:03:29", "loss": 0.113224, "lr": 0.014229, "mode": "train", "time_backward": 1.096383, "time_data": 0.018515, "time_diff": 1.562167, "time_forward": 0.398955, "time_loss": 0.000356}
[03/28 04:34:01] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "2540", "eta": "3:02:58", "loss": 0.113967, "lr": 0.014245, "mode": "train", "time_backward": 1.118309, "time_data": 0.021790, "time_diff": 1.576316, "time_forward": 0.398416, "time_loss": 0.000266}
[03/28 04:34:26] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "2550", "eta": "3:02:35", "loss": 0.105967, "lr": 0.014261, "mode": "train", "time_backward": 1.743911, "time_data": 0.017737, "time_diff": 2.586887, "time_forward": 0.812878, "time_loss": 0.000271}
[03/28 04:34:55] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "2560", "eta": "3:03:21", "loss": 0.104165, "lr": 0.014278, "mode": "train", "time_backward": 11.933096, "time_data": 0.016892, "time_diff": 12.379486, "time_forward": 0.400068, "time_loss": 0.000448}
[03/28 04:35:10] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "2570", "eta": "3:02:51", "loss": 0.108053, "lr": 0.014294, "mode": "train", "time_backward": 1.110466, "time_data": 0.038712, "time_diff": 1.566087, "time_forward": 0.401327, "time_loss": 0.000407}
[03/28 04:35:32] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "2580", "eta": "3:02:20", "loss": 0.112476, "lr": 0.014310, "mode": "train", "time_backward": 1.094082, "time_data": 0.018423, "time_diff": 1.518242, "time_forward": 0.400205, "time_loss": 0.000226}
[03/28 04:35:53] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "2590", "eta": "3:01:51", "loss": 0.106590, "lr": 0.014327, "mode": "train", "time_backward": 1.341893, "time_data": 0.030506, "time_diff": 1.826300, "time_forward": 0.406002, "time_loss": 0.016876}
[03/28 04:36:14] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "2600", "eta": "3:01:20", "loss": 0.120427, "lr": 0.014343, "mode": "train", "time_backward": 1.161682, "time_data": 0.017190, "time_diff": 1.579818, "time_forward": 0.398957, "time_loss": 0.000371}
[03/28 04:36:46] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "2610", "eta": "3:00:49", "loss": 0.110037, "lr": 0.014359, "mode": "train", "time_backward": 1.055340, "time_data": 0.016927, "time_diff": 1.478989, "time_forward": 0.398467, "time_loss": 0.000248}
[03/28 04:37:26] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "2620", "eta": "3:00:17", "loss": 0.097470, "lr": 0.014376, "mode": "train", "time_backward": 1.055637, "time_data": 0.017291, "time_diff": 1.481213, "time_forward": 0.398936, "time_loss": 0.000223}
[03/28 04:38:17] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "2630", "eta": "2:59:46", "loss": 0.115737, "lr": 0.014392, "mode": "train", "time_backward": 1.057059, "time_data": 0.018544, "time_diff": 1.523408, "time_forward": 0.398645, "time_loss": 0.000244}
[03/28 04:38:32] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "2640", "eta": "2:59:15", "loss": 0.119012, "lr": 0.014408, "mode": "train", "time_backward": 1.056197, "time_data": 0.017012, "time_diff": 1.551065, "time_forward": 0.473973, "time_loss": 0.000678}
[03/28 04:39:10] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "2650", "eta": "2:58:44", "loss": 0.103539, "lr": 0.014425, "mode": "train", "time_backward": 1.057116, "time_data": 0.030824, "time_diff": 1.544965, "time_forward": 0.398123, "time_loss": 0.000233}
[03/28 04:39:34] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "2660", "eta": "2:58:14", "loss": 0.110695, "lr": 0.014441, "mode": "train", "time_backward": 1.144800, "time_data": 0.019335, "time_diff": 1.632551, "time_forward": 0.402141, "time_loss": 0.000281}
[03/28 04:39:49] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "2670", "eta": "2:57:42", "loss": 0.112004, "lr": 0.014457, "mode": "train", "time_backward": 1.067246, "time_data": 0.017147, "time_diff": 1.494816, "time_forward": 0.403353, "time_loss": 0.000435}
[03/28 04:40:24] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "2680", "eta": "2:57:10", "loss": 0.096759, "lr": 0.014474, "mode": "train", "time_backward": 1.062298, "time_data": 0.017387, "time_diff": 1.485246, "time_forward": 0.402236, "time_loss": 0.000296}
[03/28 04:40:39] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "2690", "eta": "2:56:39", "loss": 0.126000, "lr": 0.014490, "mode": "train", "time_backward": 1.057983, "time_data": 0.017813, "time_diff": 1.483363, "time_forward": 0.399015, "time_loss": 0.000287}
[03/28 04:40:56] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "2700", "eta": "2:56:08", "loss": 0.115447, "lr": 0.014506, "mode": "train", "time_backward": 1.054407, "time_data": 0.017466, "time_diff": 1.502950, "time_forward": 0.399685, "time_loss": 0.000337}
[03/28 04:41:34] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "2710", "eta": "2:55:38", "loss": 0.104980, "lr": 0.014523, "mode": "train", "time_backward": 1.112494, "time_data": 0.017352, "time_diff": 1.539176, "time_forward": 0.398746, "time_loss": 0.000267}
[03/28 04:41:50] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "2720", "eta": "2:55:07", "loss": 0.110725, "lr": 0.014539, "mode": "train", "time_backward": 1.056046, "time_data": 0.018038, "time_diff": 1.479346, "time_forward": 0.398592, "time_loss": 0.000326}
[03/28 04:42:32] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "2730", "eta": "2:56:36", "loss": 0.119121, "lr": 0.014555, "mode": "train", "time_backward": 18.997834, "time_data": 0.021493, "time_diff": 19.445464, "time_forward": 0.401093, "time_loss": 0.000287}
[03/28 04:43:07] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "2740", "eta": "2:57:21", "loss": 0.098021, "lr": 0.014572, "mode": "train", "time_backward": 1.069367, "time_data": 11.334977, "time_diff": 12.820470, "time_forward": 0.411357, "time_loss": 0.001728}
[03/28 04:43:24] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "2750", "eta": "2:56:49", "loss": 0.100592, "lr": 0.014588, "mode": "train", "time_backward": 1.071132, "time_data": 0.017321, "time_diff": 1.504041, "time_forward": 0.406572, "time_loss": 0.000283}
[03/28 04:43:47] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "2760", "eta": "2:56:18", "loss": 0.106334, "lr": 0.014604, "mode": "train", "time_backward": 1.056496, "time_data": 0.017195, "time_diff": 1.478758, "time_forward": 0.400718, "time_loss": 0.000804}
[03/28 04:44:31] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "2770", "eta": "2:55:46", "loss": 0.101513, "lr": 0.014621, "mode": "train", "time_backward": 1.071928, "time_data": 0.033226, "time_diff": 1.575607, "time_forward": 0.400658, "time_loss": 0.000378}
[03/28 04:44:50] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "2780", "eta": "2:55:15", "loss": 0.105997, "lr": 0.014637, "mode": "train", "time_backward": 1.118129, "time_data": 0.017966, "time_diff": 1.541404, "time_forward": 0.398956, "time_loss": 0.000257}
[03/28 04:45:08] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "2790", "eta": "2:55:00", "loss": 0.096394, "lr": 0.014653, "mode": "train", "time_backward": 1.069633, "time_data": 2.588006, "time_diff": 4.087189, "time_forward": 0.422998, "time_loss": 0.003312}
[03/28 04:45:27] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "2800", "eta": "2:54:27", "loss": 0.103571, "lr": 0.014670, "mode": "train", "time_backward": 1.056395, "time_data": 0.017293, "time_diff": 1.480435, "time_forward": 0.403577, "time_loss": 0.000428}
[03/28 04:46:02] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "2810", "eta": "2:54:19", "loss": 0.107791, "lr": 0.014686, "mode": "train", "time_backward": 4.510745, "time_data": 0.022077, "time_diff": 4.959319, "time_forward": 0.422662, "time_loss": 0.000442}
[03/28 04:46:19] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "2820", "eta": "2:53:47", "loss": 0.107012, "lr": 0.014702, "mode": "train", "time_backward": 1.055044, "time_data": 0.017281, "time_diff": 1.478141, "time_forward": 0.402094, "time_loss": 0.000532}
[03/28 04:46:42] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "2830", "eta": "2:53:16", "loss": 0.096888, "lr": 0.014719, "mode": "train", "time_backward": 1.102146, "time_data": 0.016803, "time_diff": 1.520833, "time_forward": 0.398283, "time_loss": 0.000327}
[03/28 04:46:58] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "2840", "eta": "2:52:44", "loss": 0.095443, "lr": 0.014735, "mode": "train", "time_backward": 1.055704, "time_data": 0.016992, "time_diff": 1.479177, "time_forward": 0.399699, "time_loss": 0.000269}
[03/28 04:47:20] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "2850", "eta": "2:52:12", "loss": 0.119379, "lr": 0.014751, "mode": "train", "time_backward": 1.079856, "time_data": 0.018910, "time_diff": 1.536010, "time_forward": 0.401264, "time_loss": 0.000374}
[03/28 04:48:05] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "2860", "eta": "2:51:41", "loss": 0.098940, "lr": 0.014768, "mode": "train", "time_backward": 1.055840, "time_data": 0.018229, "time_diff": 1.478091, "time_forward": 0.400041, "time_loss": 0.000427}
[03/28 04:48:50] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "2870", "eta": "2:53:33", "loss": 0.117399, "lr": 0.014784, "mode": "train", "time_backward": 23.387100, "time_data": 0.016633, "time_diff": 23.809819, "time_forward": 0.398480, "time_loss": 0.000308}
[03/28 04:49:35] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "2880", "eta": "2:53:01", "loss": 0.109482, "lr": 0.014800, "mode": "train", "time_backward": 1.054450, "time_data": 0.016769, "time_diff": 1.473363, "time_forward": 0.398504, "time_loss": 0.000225}
[03/28 04:50:29] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "2890", "eta": "2:56:43", "loss": 0.092747, "lr": 0.014816, "mode": "train", "time_backward": 40.553952, "time_data": 0.017314, "time_diff": 40.973580, "time_forward": 0.399889, "time_loss": 0.000378}
[03/28 04:50:45] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "2900", "eta": "2:56:10", "loss": 0.091739, "lr": 0.014833, "mode": "train", "time_backward": 1.068943, "time_data": 0.017876, "time_diff": 1.490079, "time_forward": 0.399575, "time_loss": 0.000414}
[03/28 04:51:05] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "2910", "eta": "2:55:37", "loss": 0.112641, "lr": 0.014849, "mode": "train", "time_backward": 1.055027, "time_data": 0.018647, "time_diff": 1.488743, "time_forward": 0.411490, "time_loss": 0.000368}
[03/28 04:51:29] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "2920", "eta": "2:55:04", "loss": 0.097316, "lr": 0.014865, "mode": "train", "time_backward": 1.103581, "time_data": 0.018039, "time_diff": 1.561385, "time_forward": 0.399124, "time_loss": 0.000223}
[03/28 04:52:01] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "2930", "eta": "2:54:31", "loss": 0.103094, "lr": 0.014882, "mode": "train", "time_backward": 1.065384, "time_data": 0.016786, "time_diff": 1.506735, "time_forward": 0.399505, "time_loss": 0.000247}
[03/28 04:52:16] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "2940", "eta": "2:53:58", "loss": 0.106206, "lr": 0.014898, "mode": "train", "time_backward": 1.072107, "time_data": 0.017427, "time_diff": 1.494258, "time_forward": 0.399926, "time_loss": 0.000349}
[03/28 04:52:39] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "2950", "eta": "2:53:25", "loss": 0.111692, "lr": 0.014914, "mode": "train", "time_backward": 1.119535, "time_data": 0.016704, "time_diff": 1.543876, "time_forward": 0.399369, "time_loss": 0.000813}
[03/28 04:52:55] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "2960", "eta": "2:52:52", "loss": 0.098571, "lr": 0.014931, "mode": "train", "time_backward": 1.063863, "time_data": 0.017333, "time_diff": 1.507716, "time_forward": 0.401191, "time_loss": 0.000334}
[03/28 04:54:17] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "2970", "eta": "2:59:15", "loss": 0.112107, "lr": 0.014947, "mode": "train", "time_backward": 67.400485, "time_data": 0.017713, "time_diff": 67.827611, "time_forward": 0.400372, "time_loss": 0.000385}
[03/28 04:54:32] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "2980", "eta": "2:58:00", "loss": 0.109086, "lr": 0.014963, "mode": "train", "time_backward": 1.054920, "time_data": 0.019666, "time_diff": 1.525259, "time_forward": 0.446929, "time_loss": 0.000428}
[03/28 04:54:56] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "2990", "eta": "2:57:26", "loss": 0.103441, "lr": 0.014980, "mode": "train", "time_backward": 1.092770, "time_data": 0.016698, "time_diff": 1.527142, "time_forward": 0.398479, "time_loss": 0.000315}
[03/28 04:55:11] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "3000", "eta": "2:56:53", "loss": 0.120923, "lr": 0.014996, "mode": "train", "time_backward": 1.053059, "time_data": 0.027604, "time_diff": 1.577037, "time_forward": 0.492719, "time_loss": 0.000289}
[03/28 04:55:27] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "3010", "eta": "2:56:18", "loss": 0.104461, "lr": 0.015012, "mode": "train", "time_backward": 1.077818, "time_data": 0.019599, "time_diff": 1.576619, "time_forward": 0.475565, "time_loss": 0.000325}
[03/28 04:55:51] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "3020", "eta": "2:55:44", "loss": 0.107425, "lr": 0.015029, "mode": "train", "time_backward": 1.065418, "time_data": 0.020383, "time_diff": 1.487897, "time_forward": 0.398533, "time_loss": 0.000229}
[03/28 04:56:07] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "3030", "eta": "2:55:10", "loss": 0.100789, "lr": 0.015045, "mode": "train", "time_backward": 1.054838, "time_data": 0.016862, "time_diff": 1.480588, "time_forward": 0.400104, "time_loss": 0.000288}
[03/28 04:56:24] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "3040", "eta": "2:54:34", "loss": 0.106031, "lr": 0.015061, "mode": "train", "time_backward": 1.061890, "time_data": 0.017714, "time_diff": 1.492599, "time_forward": 0.401449, "time_loss": 0.000245}
[03/28 04:56:53] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "3050", "eta": "2:55:20", "loss": 0.090152, "lr": 0.015078, "mode": "train", "time_backward": 14.087373, "time_data": 0.016993, "time_diff": 14.621298, "time_forward": 0.400706, "time_loss": 0.000415}
[03/28 04:57:12] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "3060", "eta": "2:54:46", "loss": 0.102949, "lr": 0.015094, "mode": "train", "time_backward": 1.056566, "time_data": 0.017188, "time_diff": 1.480711, "time_forward": 0.400137, "time_loss": 0.000321}
[03/28 04:57:43] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "3070", "eta": "2:54:12", "loss": 0.099931, "lr": 0.015110, "mode": "train", "time_backward": 1.074838, "time_data": 0.019682, "time_diff": 1.525776, "time_forward": 0.407217, "time_loss": 0.000288}
[03/28 04:57:59] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "3080", "eta": "2:53:38", "loss": 0.104879, "lr": 0.015127, "mode": "train", "time_backward": 1.140468, "time_data": 0.017934, "time_diff": 1.566087, "time_forward": 0.405276, "time_loss": 0.000393}
[03/28 04:58:17] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "3090", "eta": "2:53:24", "loss": 0.101486, "lr": 0.015143, "mode": "train", "time_backward": 1.072707, "time_data": 3.402344, "time_diff": 4.940971, "time_forward": 0.462381, "time_loss": 0.000258}
[03/28 04:59:18] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "3100", "eta": "2:52:50", "loss": 0.104235, "lr": 0.015159, "mode": "train", "time_backward": 1.060598, "time_data": 0.020871, "time_diff": 1.489945, "time_forward": 0.400187, "time_loss": 0.000361}
[03/28 04:59:33] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "3110", "eta": "2:52:16", "loss": 0.094895, "lr": 0.015176, "mode": "train", "time_backward": 1.119788, "time_data": 0.018000, "time_diff": 1.558475, "time_forward": 0.401970, "time_loss": 0.000283}
[03/28 04:59:48] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "3120", "eta": "2:51:42", "loss": 0.095320, "lr": 0.015192, "mode": "train", "time_backward": 1.058930, "time_data": 0.017226, "time_diff": 1.536902, "time_forward": 0.399655, "time_loss": 0.000357}
[03/28 05:00:12] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "3130", "eta": "2:52:05", "loss": 0.124354, "lr": 0.015208, "mode": "train", "time_backward": 10.651704, "time_data": 0.017077, "time_diff": 11.083154, "time_forward": 0.398859, "time_loss": 0.000258}
[03/28 05:00:35] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "3140", "eta": "2:51:30", "loss": 0.123275, "lr": 0.015225, "mode": "train", "time_backward": 1.066863, "time_data": 0.016622, "time_diff": 1.485029, "time_forward": 0.398122, "time_loss": 0.000265}
[03/28 05:01:11] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "3150", "eta": "2:50:56", "loss": 0.099640, "lr": 0.015241, "mode": "train", "time_backward": 1.054527, "time_data": 0.024473, "time_diff": 1.493139, "time_forward": 0.398772, "time_loss": 0.004183}
[03/28 05:01:27] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "3160", "eta": "2:50:21", "loss": 0.099301, "lr": 0.015257, "mode": "train", "time_backward": 1.078554, "time_data": 0.017057, "time_diff": 1.536246, "time_forward": 0.399154, "time_loss": 0.000255}
[03/28 05:01:50] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "3170", "eta": "2:49:26", "loss": 0.108452, "lr": 0.015274, "mode": "train", "time_backward": 1.056600, "time_data": 0.018455, "time_diff": 1.481386, "time_forward": 0.399716, "time_loss": 0.000347}
[03/28 05:02:50] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "3180", "eta": "2:48:51", "loss": 0.105929, "lr": 0.015290, "mode": "train", "time_backward": 1.058484, "time_data": 0.022824, "time_diff": 1.492866, "time_forward": 0.400866, "time_loss": 0.000849}
[03/28 05:03:05] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "3190", "eta": "2:48:16", "loss": 0.112622, "lr": 0.015306, "mode": "train", "time_backward": 1.054547, "time_data": 0.017586, "time_diff": 1.481168, "time_forward": 0.399378, "time_loss": 0.000263}
[03/28 05:03:23] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "3200", "eta": "2:47:43", "loss": 0.102464, "lr": 0.015323, "mode": "train", "time_backward": 1.207342, "time_data": 0.022355, "time_diff": 1.643117, "time_forward": 0.401965, "time_loss": 0.000254}
[03/28 05:03:44] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "3210", "eta": "2:47:40", "loss": 0.095757, "lr": 0.015339, "mode": "train", "time_backward": 6.404520, "time_data": 0.016844, "time_diff": 6.883463, "time_forward": 0.454658, "time_loss": 0.000356}
[03/28 05:04:00] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "3220", "eta": "2:47:05", "loss": 0.101870, "lr": 0.015355, "mode": "train", "time_backward": 1.078005, "time_data": 0.026837, "time_diff": 1.529704, "time_forward": 0.399319, "time_loss": 0.000239}
[03/28 05:04:28] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "3230", "eta": "2:46:30", "loss": 0.084020, "lr": 0.015372, "mode": "train", "time_backward": 1.068113, "time_data": 0.018453, "time_diff": 1.493181, "time_forward": 0.399084, "time_loss": 0.000251}
[03/28 05:04:44] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "3240", "eta": "2:45:56", "loss": 0.114606, "lr": 0.015388, "mode": "train", "time_backward": 1.055271, "time_data": 0.017417, "time_diff": 1.518881, "time_forward": 0.400237, "time_loss": 0.000307}
[03/28 05:05:09] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "3250", "eta": "2:44:30", "loss": 0.096694, "lr": 0.015404, "mode": "train", "time_backward": 1.195671, "time_data": 0.017446, "time_diff": 1.705705, "time_forward": 0.399012, "time_loss": 0.000247}
[03/28 05:05:34] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "3260", "eta": "2:43:55", "loss": 0.117154, "lr": 0.015421, "mode": "train", "time_backward": 1.054909, "time_data": 0.017015, "time_diff": 1.479476, "time_forward": 0.398325, "time_loss": 0.000303}
[03/28 05:05:49] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "3270", "eta": "2:43:20", "loss": 0.105571, "lr": 0.015437, "mode": "train", "time_backward": 1.053949, "time_data": 0.016867, "time_diff": 1.474593, "time_forward": 0.400207, "time_loss": 0.000293}
[03/28 05:07:21] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "3280", "eta": "2:42:46", "loss": 0.095619, "lr": 0.015453, "mode": "train", "time_backward": 1.067867, "time_data": 0.016870, "time_diff": 1.492323, "time_forward": 0.398707, "time_loss": 0.000234}
[03/28 05:07:36] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "3290", "eta": "2:42:12", "loss": 0.101442, "lr": 0.015470, "mode": "train", "time_backward": 1.096168, "time_data": 0.017103, "time_diff": 1.516358, "time_forward": 0.399421, "time_loss": 0.000338}
[03/28 05:07:51] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "3300", "eta": "2:41:37", "loss": 0.098208, "lr": 0.015486, "mode": "train", "time_backward": 1.054504, "time_data": 0.017424, "time_diff": 1.481994, "time_forward": 0.398900, "time_loss": 0.000220}
[03/28 05:08:12] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "3310", "eta": "2:41:02", "loss": 0.101305, "lr": 0.015502, "mode": "train", "time_backward": 1.052818, "time_data": 0.016784, "time_diff": 1.471863, "time_forward": 0.398130, "time_loss": 0.000297}
[03/28 05:08:36] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "3320", "eta": "2:40:28", "loss": 0.103942, "lr": 0.015518, "mode": "train", "time_backward": 1.179772, "time_data": 0.017105, "time_diff": 1.612571, "time_forward": 0.408067, "time_loss": 0.000397}
[03/28 05:09:01] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "3330", "eta": "2:40:48", "loss": 0.113556, "lr": 0.015535, "mode": "train", "time_backward": 1.060038, "time_data": 9.706885, "time_diff": 11.184521, "time_forward": 0.413865, "time_loss": 0.000446}
[03/28 05:09:27] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "3340", "eta": "2:40:13", "loss": 0.103975, "lr": 0.015551, "mode": "train", "time_backward": 1.070235, "time_data": 0.021562, "time_diff": 1.500737, "time_forward": 0.399910, "time_loss": 0.000429}
[03/28 05:10:11] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "3350", "eta": "2:39:39", "loss": 0.122908, "lr": 0.015567, "mode": "train", "time_backward": 1.069653, "time_data": 0.017223, "time_diff": 1.490856, "time_forward": 0.400423, "time_loss": 0.000290}
[03/28 05:10:30] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "3360", "eta": "2:39:04", "loss": 0.100158, "lr": 0.015584, "mode": "train", "time_backward": 1.056642, "time_data": 0.016921, "time_diff": 1.482931, "time_forward": 0.401821, "time_loss": 0.000391}
[03/28 05:10:53] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "3370", "eta": "2:38:29", "loss": 0.096088, "lr": 0.015600, "mode": "train", "time_backward": 1.054194, "time_data": 0.037073, "time_diff": 1.497128, "time_forward": 0.398560, "time_loss": 0.000241}
[03/28 05:11:08] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "3380", "eta": "2:37:55", "loss": 0.094522, "lr": 0.015616, "mode": "train", "time_backward": 1.061965, "time_data": 0.017024, "time_diff": 1.524121, "time_forward": 0.438269, "time_loss": 0.000471}
[03/28 05:11:24] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "3390", "eta": "2:34:32", "loss": 0.124070, "lr": 0.015633, "mode": "train", "time_backward": 1.089450, "time_data": 0.017360, "time_diff": 1.512488, "time_forward": 0.398047, "time_loss": 0.000317}
[03/28 05:11:46] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "3400", "eta": "2:33:59", "loss": 0.103149, "lr": 0.015649, "mode": "train", "time_backward": 1.152152, "time_data": 0.017233, "time_diff": 1.664367, "time_forward": 0.400390, "time_loss": 0.000404}
[03/28 05:12:02] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "3410", "eta": "2:33:25", "loss": 0.119461, "lr": 0.015665, "mode": "train", "time_backward": 1.056757, "time_data": 0.019429, "time_diff": 1.482775, "time_forward": 0.400464, "time_loss": 0.000393}
[03/28 05:12:19] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "3420", "eta": "2:32:51", "loss": 0.102687, "lr": 0.015682, "mode": "train", "time_backward": 1.060934, "time_data": 0.020852, "time_diff": 1.517164, "time_forward": 0.416621, "time_loss": 0.000346}
[03/28 05:12:44] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "3430", "eta": "2:32:17", "loss": 0.105432, "lr": 0.015698, "mode": "train", "time_backward": 1.143652, "time_data": 0.018926, "time_diff": 1.624787, "time_forward": 0.399277, "time_loss": 0.000336}
[03/28 05:13:00] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "3440", "eta": "2:31:43", "loss": 0.090490, "lr": 0.015714, "mode": "train", "time_backward": 1.065043, "time_data": 0.020924, "time_diff": 1.497277, "time_forward": 0.408974, "time_loss": 0.000305}
[03/28 05:13:17] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "3450", "eta": "2:31:10", "loss": 0.118825, "lr": 0.015731, "mode": "train", "time_backward": 1.086975, "time_data": 0.031482, "time_diff": 1.526533, "time_forward": 0.401037, "time_loss": 0.000393}
[03/28 05:13:47] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "3460", "eta": "2:31:50", "loss": 0.105463, "lr": 0.015747, "mode": "train", "time_backward": 1.057816, "time_data": 14.136451, "time_diff": 15.666265, "time_forward": 0.468202, "time_loss": 0.000426}
[03/28 05:14:11] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "3470", "eta": "2:24:04", "loss": 0.093444, "lr": 0.015763, "mode": "train", "time_backward": 1.145442, "time_data": 0.021272, "time_diff": 1.586300, "time_forward": 0.415384, "time_loss": 0.000410}
[03/28 05:14:32] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "3480", "eta": "2:23:32", "loss": 0.094322, "lr": 0.015780, "mode": "train", "time_backward": 1.074168, "time_data": 0.021130, "time_diff": 1.583044, "time_forward": 0.480047, "time_loss": 0.000292}
[03/28 05:14:48] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "3490", "eta": "2:22:59", "loss": 0.115959, "lr": 0.015796, "mode": "train", "time_backward": 1.059886, "time_data": 0.027266, "time_diff": 1.504888, "time_forward": 0.407058, "time_loss": 0.000393}
[03/28 05:15:14] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "3500", "eta": "2:22:27", "loss": 0.118405, "lr": 0.015812, "mode": "train", "time_backward": 1.116062, "time_data": 0.017364, "time_diff": 1.582484, "time_forward": 0.399124, "time_loss": 0.000292}
[03/28 05:15:29] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "3510", "eta": "2:21:55", "loss": 0.098883, "lr": 0.015829, "mode": "train", "time_backward": 1.054525, "time_data": 0.019145, "time_diff": 1.482675, "time_forward": 0.405590, "time_loss": 0.000372}
[03/28 05:15:44] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "3520", "eta": "2:21:22", "loss": 0.105232, "lr": 0.015845, "mode": "train", "time_backward": 1.127156, "time_data": 0.017590, "time_diff": 1.547416, "time_forward": 0.399799, "time_loss": 0.000417}
[03/28 05:15:59] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "3530", "eta": "2:20:49", "loss": 0.091188, "lr": 0.015861, "mode": "train", "time_backward": 1.059906, "time_data": 0.017391, "time_diff": 1.486535, "time_forward": 0.401578, "time_loss": 0.000343}
[03/28 05:16:20] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "3540", "eta": "2:20:46", "loss": 0.107979, "lr": 0.015878, "mode": "train", "time_backward": 1.053550, "time_data": 5.556343, "time_diff": 7.195688, "time_forward": 0.441884, "time_loss": 0.000348}
[03/28 05:16:46] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "3550", "eta": "2:17:31", "loss": 0.110061, "lr": 0.015894, "mode": "train", "time_backward": 1.086618, "time_data": 0.016835, "time_diff": 1.546223, "time_forward": 0.398750, "time_loss": 0.000302}
[03/28 05:17:01] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "3560", "eta": "2:16:58", "loss": 0.106738, "lr": 0.015910, "mode": "train", "time_backward": 1.172991, "time_data": 0.019070, "time_diff": 1.709788, "time_forward": 0.410561, "time_loss": 0.000262}
[03/28 05:17:18] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "3570", "eta": "2:16:26", "loss": 0.083376, "lr": 0.015927, "mode": "train", "time_backward": 1.066789, "time_data": 0.021892, "time_diff": 1.548701, "time_forward": 0.403247, "time_loss": 0.000231}
[03/28 05:17:34] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "3580", "eta": "2:13:48", "loss": 0.100599, "lr": 0.015943, "mode": "train", "time_backward": 1.083927, "time_data": 0.019190, "time_diff": 1.532684, "time_forward": 0.417958, "time_loss": 0.000257}
[03/28 05:17:49] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "3590", "eta": "2:13:17", "loss": 0.104988, "lr": 0.015959, "mode": "train", "time_backward": 1.060922, "time_data": 0.016542, "time_diff": 1.486294, "time_forward": 0.399669, "time_loss": 0.000231}
[03/28 05:18:05] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "3600", "eta": "2:12:45", "loss": 0.111047, "lr": 0.015976, "mode": "train", "time_backward": 1.067367, "time_data": 0.017466, "time_diff": 1.510157, "time_forward": 0.398849, "time_loss": 0.000312}
[03/28 05:18:20] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "3610", "eta": "2:12:10", "loss": 0.109534, "lr": 0.015992, "mode": "train", "time_backward": 1.056925, "time_data": 0.017475, "time_diff": 1.480334, "time_forward": 0.399232, "time_loss": 0.000364}
[03/28 05:19:02] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "3620", "eta": "2:11:52", "loss": 0.089324, "lr": 0.016008, "mode": "train", "time_backward": 1.087702, "time_data": 2.803032, "time_diff": 4.302144, "time_forward": 0.408114, "time_loss": 0.000395}
[03/28 05:19:26] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "3630", "eta": "2:10:11", "loss": 0.105772, "lr": 0.016025, "mode": "train", "time_backward": 1.055825, "time_data": 0.018299, "time_diff": 1.479556, "time_forward": 0.399516, "time_loss": 0.000274}
[03/28 05:19:58] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "3640", "eta": "2:09:39", "loss": 0.107248, "lr": 0.016041, "mode": "train", "time_backward": 1.060089, "time_data": 0.017062, "time_diff": 1.495013, "time_forward": 0.414583, "time_loss": 0.000633}
[03/28 05:20:15] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "3650", "eta": "2:09:08", "loss": 0.113249, "lr": 0.016057, "mode": "train", "time_backward": 1.079451, "time_data": 0.022813, "time_diff": 1.518581, "time_forward": 0.400671, "time_loss": 0.000235}
[03/28 05:20:31] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "3660", "eta": "2:08:37", "loss": 0.110619, "lr": 0.016074, "mode": "train", "time_backward": 1.053763, "time_data": 0.016960, "time_diff": 1.606585, "time_forward": 0.417955, "time_loss": 0.000352}
[03/28 05:20:49] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "3670", "eta": "2:08:06", "loss": 0.109931, "lr": 0.016090, "mode": "train", "time_backward": 1.054613, "time_data": 0.017507, "time_diff": 1.594992, "time_forward": 0.475284, "time_loss": 0.000651}
[03/28 05:21:05] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "3680", "eta": "2:07:34", "loss": 0.114185, "lr": 0.016106, "mode": "train", "time_backward": 1.139834, "time_data": 0.017254, "time_diff": 1.582090, "time_forward": 0.411536, "time_loss": 0.000274}
[03/28 05:21:26] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "3690", "eta": "2:07:03", "loss": 0.102861, "lr": 0.016123, "mode": "train", "time_backward": 1.099975, "time_data": 0.017157, "time_diff": 1.544835, "time_forward": 0.415597, "time_loss": 0.000288}
[03/28 05:21:46] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "3700", "eta": "2:06:31", "loss": 0.109378, "lr": 0.016139, "mode": "train", "time_backward": 1.060180, "time_data": 0.017564, "time_diff": 1.485759, "time_forward": 0.403804, "time_loss": 0.000250}
[03/28 05:22:01] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "3710", "eta": "2:03:50", "loss": 0.099062, "lr": 0.016155, "mode": "train", "time_backward": 1.100352, "time_data": 0.017617, "time_diff": 1.548675, "time_forward": 0.425355, "time_loss": 0.000335}
[03/28 05:22:17] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "3720", "eta": "2:03:19", "loss": 0.098728, "lr": 0.016172, "mode": "train", "time_backward": 1.167915, "time_data": 0.017421, "time_diff": 1.595478, "time_forward": 0.399045, "time_loss": 0.000348}
[03/28 05:22:32] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "3730", "eta": "2:02:48", "loss": 0.095952, "lr": 0.016188, "mode": "train", "time_backward": 1.065965, "time_data": 0.032731, "time_diff": 1.523250, "time_forward": 0.416882, "time_loss": 0.000352}
[03/28 05:22:48] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "3740", "eta": "2:02:17", "loss": 0.102661, "lr": 0.016204, "mode": "train", "time_backward": 1.056589, "time_data": 0.018219, "time_diff": 1.482398, "time_forward": 0.400018, "time_loss": 0.000361}
[03/28 05:23:04] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "3750", "eta": "2:01:46", "loss": 0.100118, "lr": 0.016221, "mode": "train", "time_backward": 1.054974, "time_data": 0.017685, "time_diff": 1.588105, "time_forward": 0.507741, "time_loss": 0.000431}
[03/28 05:23:20] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "3760", "eta": "2:01:16", "loss": 0.085323, "lr": 0.016237, "mode": "train", "time_backward": 1.137898, "time_data": 0.020662, "time_diff": 1.636151, "time_forward": 0.425000, "time_loss": 0.000345}
[03/28 05:23:37] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "3770", "eta": "2:00:45", "loss": 0.100798, "lr": 0.016253, "mode": "train", "time_backward": 1.080576, "time_data": 0.027030, "time_diff": 1.527372, "time_forward": 0.405070, "time_loss": 0.000390}
[03/28 05:23:53] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "3780", "eta": "2:00:15", "loss": 0.096729, "lr": 0.016269, "mode": "train", "time_backward": 1.079341, "time_data": 0.115288, "time_diff": 1.681536, "time_forward": 0.418983, "time_loss": 0.000304}
[03/28 05:24:08] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "3790", "eta": "1:59:45", "loss": 0.091066, "lr": 0.016286, "mode": "train", "time_backward": 1.174344, "time_data": 0.016983, "time_diff": 1.600977, "time_forward": 0.406027, "time_loss": 0.000353}
[03/28 05:24:29] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "3800", "eta": "1:59:14", "loss": 0.112867, "lr": 0.016302, "mode": "train", "time_backward": 1.054547, "time_data": 0.017396, "time_diff": 1.480682, "time_forward": 0.399256, "time_loss": 0.000311}
[03/28 05:24:45] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "3810", "eta": "1:58:43", "loss": 0.090841, "lr": 0.016318, "mode": "train", "time_backward": 1.054420, "time_data": 0.018159, "time_diff": 1.610596, "time_forward": 0.534248, "time_loss": 0.000404}
[03/28 05:25:05] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "3820", "eta": "1:58:12", "loss": 0.107068, "lr": 0.016335, "mode": "train", "time_backward": 1.080154, "time_data": 0.040983, "time_diff": 1.562450, "time_forward": 0.415523, "time_loss": 0.000236}
[03/28 05:25:22] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "3830", "eta": "1:57:41", "loss": 0.090789, "lr": 0.016351, "mode": "train", "time_backward": 1.116292, "time_data": 0.018459, "time_diff": 1.540667, "time_forward": 0.398385, "time_loss": 0.000260}
[03/28 05:25:40] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "3840", "eta": "1:57:10", "loss": 0.098225, "lr": 0.016367, "mode": "train", "time_backward": 1.073552, "time_data": 0.036626, "time_diff": 1.539733, "time_forward": 0.408932, "time_loss": 0.000254}
[03/28 05:25:57] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "3850", "eta": "1:56:40", "loss": 0.104412, "lr": 0.016384, "mode": "train", "time_backward": 1.267394, "time_data": 0.017225, "time_diff": 1.692024, "time_forward": 0.401348, "time_loss": 0.000388}
[03/28 05:26:13] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "3860", "eta": "1:56:09", "loss": 0.094145, "lr": 0.016400, "mode": "train", "time_backward": 1.059984, "time_data": 0.019171, "time_diff": 1.482337, "time_forward": 0.400454, "time_loss": 0.000249}
[03/28 05:26:28] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "3870", "eta": "1:55:38", "loss": 0.080840, "lr": 0.016416, "mode": "train", "time_backward": 1.064907, "time_data": 0.017062, "time_diff": 1.488059, "time_forward": 0.402792, "time_loss": 0.000427}
[03/28 05:26:44] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "3880", "eta": "1:55:08", "loss": 0.085534, "lr": 0.016433, "mode": "train", "time_backward": 1.057893, "time_data": 0.019982, "time_diff": 1.685543, "time_forward": 0.398263, "time_loss": 0.000306}
[03/28 05:27:00] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "3890", "eta": "1:54:37", "loss": 0.107564, "lr": 0.016449, "mode": "train", "time_backward": 1.086852, "time_data": 0.016667, "time_diff": 1.505442, "time_forward": 0.398340, "time_loss": 0.000221}
[03/28 05:27:15] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "3900", "eta": "1:54:06", "loss": 0.098244, "lr": 0.016465, "mode": "train", "time_backward": 1.092318, "time_data": 0.017843, "time_diff": 1.527221, "time_forward": 0.413573, "time_loss": 0.000284}
[03/28 05:27:43] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "3910", "eta": "1:53:35", "loss": 0.098119, "lr": 0.016482, "mode": "train", "time_backward": 1.057190, "time_data": 0.017314, "time_diff": 1.480079, "time_forward": 0.397800, "time_loss": 0.000194}
[03/28 05:27:59] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "3920", "eta": "1:53:04", "loss": 0.101711, "lr": 0.016498, "mode": "train", "time_backward": 1.053303, "time_data": 0.017934, "time_diff": 1.514428, "time_forward": 0.439595, "time_loss": 0.000285}
[03/28 05:28:19] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "3930", "eta": "1:52:33", "loss": 0.097149, "lr": 0.016514, "mode": "train", "time_backward": 1.091844, "time_data": 0.017453, "time_diff": 1.523468, "time_forward": 0.402554, "time_loss": 0.000264}
[03/28 05:28:36] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "3940", "eta": "1:52:02", "loss": 0.094730, "lr": 0.016531, "mode": "train", "time_backward": 1.055847, "time_data": 0.017182, "time_diff": 1.481773, "time_forward": 0.400050, "time_loss": 0.000274}
[03/28 05:28:56] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "3950", "eta": "1:51:31", "loss": 0.095760, "lr": 0.016547, "mode": "train", "time_backward": 1.064281, "time_data": 0.018161, "time_diff": 1.529143, "time_forward": 0.444625, "time_loss": 0.000422}
[03/28 05:29:11] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "3960", "eta": "1:51:00", "loss": 0.101013, "lr": 0.016563, "mode": "train", "time_backward": 1.056488, "time_data": 0.017886, "time_diff": 1.477003, "time_forward": 0.398596, "time_loss": 0.000923}
[03/28 05:30:12] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "3970", "eta": "1:50:28", "loss": 0.090570, "lr": 0.016580, "mode": "train", "time_backward": 1.071531, "time_data": 0.016836, "time_diff": 1.525573, "time_forward": 0.398195, "time_loss": 0.000248}
[03/28 05:30:34] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "3980", "eta": "1:49:57", "loss": 0.106106, "lr": 0.016596, "mode": "train", "time_backward": 1.076696, "time_data": 0.019022, "time_diff": 1.507487, "time_forward": 0.400095, "time_loss": 0.000406}
[03/28 05:30:50] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "3990", "eta": "1:49:26", "loss": 0.089598, "lr": 0.016612, "mode": "train", "time_backward": 1.116322, "time_data": 0.017007, "time_diff": 1.539782, "time_forward": 0.401292, "time_loss": 0.000610}
[03/28 05:31:15] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "4000", "eta": "1:48:56", "loss": 0.096833, "lr": 0.016629, "mode": "train", "time_backward": 1.125344, "time_data": 0.017359, "time_diff": 1.572701, "time_forward": 0.418528, "time_loss": 0.000261}
[03/28 05:31:30] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "4010", "eta": "1:48:24", "loss": 0.117905, "lr": 0.016645, "mode": "train", "time_backward": 1.060294, "time_data": 0.017024, "time_diff": 1.486032, "time_forward": 0.400371, "time_loss": 0.000349}
[03/28 05:31:46] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "4020", "eta": "1:47:53", "loss": 0.104398, "lr": 0.016661, "mode": "train", "time_backward": 1.122520, "time_data": 0.017454, "time_diff": 1.547364, "time_forward": 0.398447, "time_loss": 0.000313}
[03/28 05:32:14] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "4030", "eta": "1:47:21", "loss": 0.095534, "lr": 0.016678, "mode": "train", "time_backward": 1.057375, "time_data": 0.016931, "time_diff": 1.517077, "time_forward": 0.399820, "time_loss": 0.000351}
[03/28 05:32:30] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "4040", "eta": "1:46:50", "loss": 0.099063, "lr": 0.016694, "mode": "train", "time_backward": 1.114288, "time_data": 0.024981, "time_diff": 1.549966, "time_forward": 0.398602, "time_loss": 0.000228}
[03/28 05:32:47] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "4050", "eta": "1:46:19", "loss": 0.101824, "lr": 0.016710, "mode": "train", "time_backward": 1.090181, "time_data": 0.019763, "time_diff": 1.516567, "time_forward": 0.401071, "time_loss": 0.000260}
[03/28 05:33:03] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "4060", "eta": "1:45:48", "loss": 0.097376, "lr": 0.016727, "mode": "train", "time_backward": 1.164435, "time_data": 0.018871, "time_diff": 1.607463, "time_forward": 0.400468, "time_loss": 0.000295}
[03/28 05:33:26] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "4070", "eta": "1:45:18", "loss": 0.102200, "lr": 0.016743, "mode": "train", "time_backward": 1.082914, "time_data": 0.017801, "time_diff": 1.568836, "time_forward": 0.398693, "time_loss": 0.000313}
[03/28 05:33:48] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "4080", "eta": "1:44:47", "loss": 0.108107, "lr": 0.016759, "mode": "train", "time_backward": 1.084645, "time_data": 0.018337, "time_diff": 1.534035, "time_forward": 0.397976, "time_loss": 0.000218}
[03/28 05:34:03] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "4090", "eta": "1:44:16", "loss": 0.093470, "lr": 0.016776, "mode": "train", "time_backward": 1.094294, "time_data": 0.016781, "time_diff": 1.621963, "time_forward": 0.397900, "time_loss": 0.000320}
[03/28 05:34:22] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "4100", "eta": "1:43:45", "loss": 0.115959, "lr": 0.016792, "mode": "train", "time_backward": 1.073302, "time_data": 0.018515, "time_diff": 1.506504, "time_forward": 0.402135, "time_loss": 0.000339}
[03/28 05:34:38] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "4110", "eta": "1:43:14", "loss": 0.099364, "lr": 0.016808, "mode": "train", "time_backward": 1.099873, "time_data": 0.016901, "time_diff": 1.523286, "time_forward": 0.399518, "time_loss": 0.000271}
[03/28 05:34:54] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "4120", "eta": "1:42:43", "loss": 0.104563, "lr": 0.016825, "mode": "train", "time_backward": 1.128386, "time_data": 0.017051, "time_diff": 1.560672, "time_forward": 0.406634, "time_loss": 0.000425}
[03/28 05:35:11] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "4130", "eta": "1:42:13", "loss": 0.104781, "lr": 0.016841, "mode": "train", "time_backward": 1.053213, "time_data": 0.025035, "time_diff": 1.624507, "time_forward": 0.536425, "time_loss": 0.000410}
[03/28 05:35:35] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "4140", "eta": "1:41:42", "loss": 0.095745, "lr": 0.016857, "mode": "train", "time_backward": 1.068975, "time_data": 0.020744, "time_diff": 1.502089, "time_forward": 0.401556, "time_loss": 0.000226}
[03/28 05:35:50] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "4150", "eta": "1:41:11", "loss": 0.098062, "lr": 0.016874, "mode": "train", "time_backward": 1.056083, "time_data": 0.016674, "time_diff": 1.478368, "time_forward": 0.398225, "time_loss": 0.000211}
[03/28 05:36:27] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "4160", "eta": "1:40:41", "loss": 0.120953, "lr": 0.016890, "mode": "train", "time_backward": 1.081494, "time_data": 0.018416, "time_diff": 1.689433, "time_forward": 0.445985, "time_loss": 0.000248}
[03/28 05:36:42] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "4170", "eta": "1:40:10", "loss": 0.101976, "lr": 0.016906, "mode": "train", "time_backward": 1.088399, "time_data": 0.029759, "time_diff": 1.520115, "time_forward": 0.398494, "time_loss": 0.000243}
[03/28 05:37:12] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "4180", "eta": "1:40:35", "loss": 0.091611, "lr": 0.016923, "mode": "train", "time_backward": 15.884133, "time_data": 0.016623, "time_diff": 16.306300, "time_forward": 0.398637, "time_loss": 0.000332}
[03/28 05:37:40] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "4190", "eta": "1:40:04", "loss": 0.104872, "lr": 0.016939, "mode": "train", "time_backward": 1.054463, "time_data": 0.017752, "time_diff": 1.481574, "time_forward": 0.401016, "time_loss": 0.000328}
[03/28 05:37:56] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "4200", "eta": "1:39:33", "loss": 0.093116, "lr": 0.016955, "mode": "train", "time_backward": 1.156349, "time_data": 0.033142, "time_diff": 1.630665, "time_forward": 0.432124, "time_loss": 0.000245}
[03/28 05:38:13] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "4210", "eta": "1:38:47", "loss": 0.103066, "lr": 0.016971, "mode": "train", "time_backward": 1.063102, "time_data": 0.017154, "time_diff": 1.488529, "time_forward": 0.399951, "time_loss": 0.000339}
[03/28 05:38:35] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "4220", "eta": "1:38:15", "loss": 0.095833, "lr": 0.016988, "mode": "train", "time_backward": 1.054398, "time_data": 0.017623, "time_diff": 1.525568, "time_forward": 0.444360, "time_loss": 0.000304}
[03/28 05:39:03] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "4230", "eta": "1:37:44", "loss": 0.098545, "lr": 0.017004, "mode": "train", "time_backward": 1.137299, "time_data": 0.016931, "time_diff": 1.582011, "time_forward": 0.399234, "time_loss": 0.000396}
[03/28 05:39:21] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "4240", "eta": "1:37:13", "loss": 0.099377, "lr": 0.017020, "mode": "train", "time_backward": 1.071428, "time_data": 0.017777, "time_diff": 1.495625, "time_forward": 0.399476, "time_loss": 0.000222}
[03/28 05:39:39] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "4250", "eta": "1:36:42", "loss": 0.095801, "lr": 0.017037, "mode": "train", "time_backward": 1.100511, "time_data": 0.017383, "time_diff": 1.541985, "time_forward": 0.399638, "time_loss": 0.000329}
[03/28 05:39:55] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "4260", "eta": "1:36:10", "loss": 0.093484, "lr": 0.017053, "mode": "train", "time_backward": 1.076952, "time_data": 0.018958, "time_diff": 1.516872, "time_forward": 0.412054, "time_loss": 0.000389}
[03/28 05:40:15] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "4270", "eta": "1:35:40", "loss": 0.113521, "lr": 0.017069, "mode": "train", "time_backward": 1.077931, "time_data": 0.022265, "time_diff": 1.746371, "time_forward": 0.398658, "time_loss": 0.000382}
[03/28 05:40:31] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "4280", "eta": "1:35:08", "loss": 0.109690, "lr": 0.017086, "mode": "train", "time_backward": 1.140789, "time_data": 0.017062, "time_diff": 1.565296, "time_forward": 0.398488, "time_loss": 0.000233}
[03/28 05:40:48] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "4290", "eta": "1:34:38", "loss": 0.097841, "lr": 0.017102, "mode": "train", "time_backward": 1.145095, "time_data": 0.084902, "time_diff": 1.659952, "time_forward": 0.407131, "time_loss": 0.000352}
[03/28 05:41:05] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "4300", "eta": "1:34:07", "loss": 0.103452, "lr": 0.017118, "mode": "train", "time_backward": 1.099876, "time_data": 0.019919, "time_diff": 1.544361, "time_forward": 0.399532, "time_loss": 0.000318}
[03/28 05:41:21] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "4310", "eta": "1:33:36", "loss": 0.108477, "lr": 0.017135, "mode": "train", "time_backward": 1.068885, "time_data": 0.017296, "time_diff": 1.548102, "time_forward": 0.448914, "time_loss": 0.000228}
[03/28 05:41:43] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "4320", "eta": "1:32:24", "loss": 0.109139, "lr": 0.017151, "mode": "train", "time_backward": 1.070092, "time_data": 0.017303, "time_diff": 1.493801, "time_forward": 0.399976, "time_loss": 0.000325}
[03/28 05:42:03] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "4330", "eta": "1:31:57", "loss": 0.105500, "lr": 0.017167, "mode": "train", "time_backward": 1.608571, "time_data": 0.016722, "time_diff": 2.573465, "time_forward": 0.423704, "time_loss": 0.000335}
[03/28 05:42:24] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "4340", "eta": "1:31:26", "loss": 0.092198, "lr": 0.017184, "mode": "train", "time_backward": 1.065804, "time_data": 0.017098, "time_diff": 1.496308, "time_forward": 0.403727, "time_loss": 0.000374}
[03/28 05:42:40] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "4350", "eta": "1:30:55", "loss": 0.112956, "lr": 0.017200, "mode": "train", "time_backward": 1.087367, "time_data": 0.025187, "time_diff": 1.532252, "time_forward": 0.400806, "time_loss": 0.000335}
[03/28 05:43:23] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "4360", "eta": "1:32:03", "loss": 0.088110, "lr": 0.017216, "mode": "train", "time_backward": 1.243883, "time_data": 27.285834, "time_diff": 29.922992, "time_forward": 1.336463, "time_loss": 0.004049}
[03/28 05:43:39] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "4370", "eta": "1:31:32", "loss": 0.097614, "lr": 0.017233, "mode": "train", "time_backward": 1.083581, "time_data": 0.021210, "time_diff": 1.532934, "time_forward": 0.411141, "time_loss": 0.000245}
[03/28 05:43:55] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "4380", "eta": "1:31:01", "loss": 0.105157, "lr": 0.017249, "mode": "train", "time_backward": 1.054736, "time_data": 0.057314, "time_diff": 1.603027, "time_forward": 0.456618, "time_loss": 0.022009}
[03/28 05:44:10] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "4390", "eta": "1:30:30", "loss": 0.093135, "lr": 0.017265, "mode": "train", "time_backward": 1.168575, "time_data": 0.017737, "time_diff": 1.716956, "time_forward": 0.524354, "time_loss": 0.000331}
[03/28 05:44:27] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "4400", "eta": "1:29:58", "loss": 0.092366, "lr": 0.017282, "mode": "train", "time_backward": 1.067645, "time_data": 0.021704, "time_diff": 1.518908, "time_forward": 0.418992, "time_loss": 0.000342}
[03/28 05:44:44] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "4410", "eta": "1:29:27", "loss": 0.118105, "lr": 0.017298, "mode": "train", "time_backward": 1.170686, "time_data": 0.022074, "time_diff": 1.591953, "time_forward": 0.398507, "time_loss": 0.000254}
[03/28 05:44:59] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "4420", "eta": "1:28:56", "loss": 0.096730, "lr": 0.017314, "mode": "train", "time_backward": 1.072056, "time_data": 0.016755, "time_diff": 1.492958, "time_forward": 0.398135, "time_loss": 0.000342}
[03/28 05:45:18] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "4430", "eta": "1:28:24", "loss": 0.092183, "lr": 0.017331, "mode": "train", "time_backward": 1.053912, "time_data": 0.017172, "time_diff": 1.520345, "time_forward": 0.443477, "time_loss": 0.000293}
[03/28 05:45:34] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "4440", "eta": "1:27:53", "loss": 0.106415, "lr": 0.017347, "mode": "train", "time_backward": 1.051603, "time_data": 0.017400, "time_diff": 1.496840, "time_forward": 0.424187, "time_loss": 0.000251}
[03/28 05:46:14] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "4450", "eta": "1:27:22", "loss": 0.105600, "lr": 0.017363, "mode": "train", "time_backward": 1.100296, "time_data": 0.019609, "time_diff": 1.576307, "time_forward": 0.398427, "time_loss": 0.000239}
[03/28 05:46:38] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "4460", "eta": "1:26:50", "loss": 0.086340, "lr": 0.017380, "mode": "train", "time_backward": 1.068051, "time_data": 0.021250, "time_diff": 1.498190, "time_forward": 0.400824, "time_loss": 0.000320}
[03/28 05:46:54] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "4470", "eta": "1:26:19", "loss": 0.088722, "lr": 0.017396, "mode": "train", "time_backward": 1.157049, "time_data": 0.020546, "time_diff": 1.585716, "time_forward": 0.400782, "time_loss": 0.000585}
[03/28 05:47:10] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "4480", "eta": "1:25:47", "loss": 0.098769, "lr": 0.017412, "mode": "train", "time_backward": 1.110456, "time_data": 0.026028, "time_diff": 1.542333, "time_forward": 0.398165, "time_loss": 0.000344}
[03/28 05:47:31] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "4490", "eta": "1:25:15", "loss": 0.105114, "lr": 0.017429, "mode": "train", "time_backward": 1.064879, "time_data": 0.017296, "time_diff": 1.488056, "time_forward": 0.402159, "time_loss": 0.000335}
[03/28 05:47:46] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "4500", "eta": "1:24:44", "loss": 0.092814, "lr": 0.017445, "mode": "train", "time_backward": 1.064631, "time_data": 0.016820, "time_diff": 1.543720, "time_forward": 0.398202, "time_loss": 0.000338}
[03/28 05:48:09] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "4510", "eta": "1:24:23", "loss": 0.097472, "lr": 0.017461, "mode": "train", "time_backward": 1.769382, "time_data": 0.088967, "time_diff": 4.894695, "time_forward": 2.578474, "time_loss": 0.077363}
[03/28 05:48:24] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "4520", "eta": "1:23:14", "loss": 0.103325, "lr": 0.017478, "mode": "train", "time_backward": 1.052351, "time_data": 0.166974, "time_diff": 1.629809, "time_forward": 0.399094, "time_loss": 0.000363}
[03/28 05:48:44] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "4530", "eta": "1:22:43", "loss": 0.102135, "lr": 0.017494, "mode": "train", "time_backward": 1.055691, "time_data": 0.016936, "time_diff": 1.482376, "time_forward": 0.401632, "time_loss": 0.000261}
[03/28 05:49:31] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "4540", "eta": "1:22:11", "loss": 0.112204, "lr": 0.017510, "mode": "train", "time_backward": 1.058023, "time_data": 0.017474, "time_diff": 1.481113, "time_forward": 0.403649, "time_loss": 0.000336}
[03/28 05:50:40] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "4550", "eta": "1:21:40", "loss": 0.098522, "lr": 0.017527, "mode": "train", "time_backward": 1.102585, "time_data": 0.017338, "time_diff": 1.565488, "time_forward": 0.399506, "time_loss": 0.001479}
[03/28 05:50:57] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "4560", "eta": "1:21:14", "loss": 0.095383, "lr": 0.017543, "mode": "train", "time_backward": 1.061221, "time_data": 1.957919, "time_diff": 3.427101, "time_forward": 0.400779, "time_loss": 0.000299}
[03/28 05:51:12] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "4570", "eta": "1:20:43", "loss": 0.097692, "lr": 0.017559, "mode": "train", "time_backward": 1.058584, "time_data": 0.017327, "time_diff": 1.483702, "time_forward": 0.399581, "time_loss": 0.000345}
[03/28 05:51:50] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "4580", "eta": "1:20:11", "loss": 0.091071, "lr": 0.017576, "mode": "train", "time_backward": 1.060758, "time_data": 0.018605, "time_diff": 1.482187, "time_forward": 0.399408, "time_loss": 0.000323}
[03/28 05:52:06] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "4590", "eta": "1:19:39", "loss": 0.091819, "lr": 0.017592, "mode": "train", "time_backward": 1.077748, "time_data": 0.017561, "time_diff": 1.500763, "time_forward": 0.398455, "time_loss": 0.000327}
[03/28 05:52:23] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "4600", "eta": "1:19:08", "loss": 0.102644, "lr": 0.017608, "mode": "train", "time_backward": 1.124910, "time_data": 0.024695, "time_diff": 1.663810, "time_forward": 0.398976, "time_loss": 0.000388}
[03/28 05:52:40] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "4610", "eta": "1:18:36", "loss": 0.093218, "lr": 0.017625, "mode": "train", "time_backward": 1.062556, "time_data": 0.016636, "time_diff": 1.485310, "time_forward": 0.398004, "time_loss": 0.000223}
[03/28 05:52:55] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "4620", "eta": "1:18:05", "loss": 0.096220, "lr": 0.017641, "mode": "train", "time_backward": 1.054544, "time_data": 0.017199, "time_diff": 1.477391, "time_forward": 0.401992, "time_loss": 0.000354}
[03/28 05:53:32] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "4630", "eta": "1:17:33", "loss": 0.095495, "lr": 0.017657, "mode": "train", "time_backward": 1.155565, "time_data": 0.016861, "time_diff": 1.578322, "time_forward": 0.401940, "time_loss": 0.000264}
[03/28 05:53:57] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "4640", "eta": "1:17:31", "loss": 0.102058, "lr": 0.017673, "mode": "train", "time_backward": 1.270210, "time_data": 9.350905, "time_diff": 11.387716, "time_forward": 0.765827, "time_loss": 0.000398}
[03/28 05:54:13] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "4650", "eta": "1:16:59", "loss": 0.091105, "lr": 0.017690, "mode": "train", "time_backward": 1.056521, "time_data": 0.017496, "time_diff": 1.502091, "time_forward": 0.399388, "time_loss": 0.000324}
[03/28 05:54:37] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "4660", "eta": "1:16:27", "loss": 0.104382, "lr": 0.017706, "mode": "train", "time_backward": 1.056978, "time_data": 0.016750, "time_diff": 1.481780, "time_forward": 0.399751, "time_loss": 0.000325}
[03/28 05:54:52] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "4670", "eta": "1:15:56", "loss": 0.092826, "lr": 0.017722, "mode": "train", "time_backward": 1.098260, "time_data": 0.017771, "time_diff": 1.524161, "time_forward": 0.401529, "time_loss": 0.000484}
[03/28 05:55:10] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "4680", "eta": "1:15:25", "loss": 0.081375, "lr": 0.017739, "mode": "train", "time_backward": 1.174436, "time_data": 0.017174, "time_diff": 1.595654, "time_forward": 0.403426, "time_loss": 0.000270}
[03/28 05:55:31] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "4690", "eta": "1:14:53", "loss": 0.099557, "lr": 0.017755, "mode": "train", "time_backward": 1.134604, "time_data": 0.026696, "time_diff": 1.579932, "time_forward": 0.404209, "time_loss": 0.000410}
[03/28 05:55:53] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "4700", "eta": "1:14:25", "loss": 0.099136, "lr": 0.017771, "mode": "train", "time_backward": 1.448674, "time_data": 0.016949, "time_diff": 2.932550, "time_forward": 1.406887, "time_loss": 0.052778}
[03/28 05:56:31] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "4710", "eta": "1:13:53", "loss": 0.095605, "lr": 0.017788, "mode": "train", "time_backward": 1.059531, "time_data": 0.017659, "time_diff": 1.483306, "time_forward": 0.398263, "time_loss": 0.000245}
[03/28 05:57:42] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "4720", "eta": "1:13:21", "loss": 0.104486, "lr": 0.017804, "mode": "train", "time_backward": 1.125027, "time_data": 0.017931, "time_diff": 1.545450, "time_forward": 0.398998, "time_loss": 0.000234}
[03/28 05:57:59] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "4730", "eta": "1:12:49", "loss": 0.093782, "lr": 0.017820, "mode": "train", "time_backward": 1.061702, "time_data": 0.017274, "time_diff": 1.482122, "time_forward": 0.399752, "time_loss": 0.000316}
[03/28 05:58:53] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "4740", "eta": "1:12:17", "loss": 0.086193, "lr": 0.017837, "mode": "train", "time_backward": 1.057557, "time_data": 0.016906, "time_diff": 1.481088, "time_forward": 0.399209, "time_loss": 0.000323}
[03/28 05:59:43] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "4750", "eta": "1:13:00", "loss": 0.101340, "lr": 0.017853, "mode": "train", "time_backward": 29.172319, "time_data": 0.017098, "time_diff": 29.631361, "time_forward": 0.402508, "time_loss": 0.000543}
[03/28 06:00:56] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "4760", "eta": "1:12:28", "loss": 0.098951, "lr": 0.017869, "mode": "train", "time_backward": 1.057759, "time_data": 0.017181, "time_diff": 1.538349, "time_forward": 0.400214, "time_loss": 0.000352}
[03/28 06:01:35] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "4770", "eta": "1:11:55", "loss": 0.096422, "lr": 0.017886, "mode": "train", "time_backward": 1.055386, "time_data": 0.017186, "time_diff": 1.474655, "time_forward": 0.398519, "time_loss": 0.000220}
[03/28 06:01:50] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "4780", "eta": "1:11:23", "loss": 0.093743, "lr": 0.017902, "mode": "train", "time_backward": 1.062526, "time_data": 0.017473, "time_diff": 1.490066, "time_forward": 0.406572, "time_loss": 0.000317}
[03/28 06:02:32] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "4790", "eta": "1:10:51", "loss": 0.100875, "lr": 0.017918, "mode": "train", "time_backward": 1.064345, "time_data": 0.018720, "time_diff": 1.487559, "time_forward": 0.401969, "time_loss": 0.000380}
[03/28 06:02:48] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "4800", "eta": "1:10:19", "loss": 0.114710, "lr": 0.017935, "mode": "train", "time_backward": 1.093147, "time_data": 0.022759, "time_diff": 1.527780, "time_forward": 0.399378, "time_loss": 0.000269}
[03/28 06:03:12] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "4810", "eta": "1:09:45", "loss": 0.091237, "lr": 0.017951, "mode": "train", "time_backward": 1.116419, "time_data": 0.019849, "time_diff": 1.547258, "time_forward": 0.405119, "time_loss": 0.000323}
[03/28 06:03:30] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "4820", "eta": "1:09:13", "loss": 0.095248, "lr": 0.017967, "mode": "train", "time_backward": 1.063291, "time_data": 0.017082, "time_diff": 1.486452, "time_forward": 0.398380, "time_loss": 0.000254}
[03/28 06:04:02] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "4830", "eta": "1:08:41", "loss": 0.102250, "lr": 0.017984, "mode": "train", "time_backward": 1.059274, "time_data": 0.018636, "time_diff": 1.506707, "time_forward": 0.418926, "time_loss": 0.000477}
[03/28 06:04:30] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "4840", "eta": "1:08:09", "loss": 0.096753, "lr": 0.018000, "mode": "train", "time_backward": 1.067202, "time_data": 0.016947, "time_diff": 1.487562, "time_forward": 0.400183, "time_loss": 0.000266}
[03/28 06:05:06] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "4850", "eta": "1:08:15", "loss": 0.088679, "lr": 0.018016, "mode": "train", "time_backward": 1.066320, "time_data": 15.257789, "time_diff": 16.880913, "time_forward": 0.552023, "time_loss": 0.000772}
[03/28 06:06:13] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "4860", "eta": "1:07:43", "loss": 0.094995, "lr": 0.018033, "mode": "train", "time_backward": 1.091935, "time_data": 0.017358, "time_diff": 1.601150, "time_forward": 0.487025, "time_loss": 0.000241}
[03/28 06:06:28] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "4870", "eta": "1:07:11", "loss": 0.097759, "lr": 0.018049, "mode": "train", "time_backward": 1.439882, "time_data": 0.037499, "time_diff": 1.884406, "time_forward": 0.399311, "time_loss": 0.000296}
[03/28 06:06:43] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "4880", "eta": "1:06:38", "loss": 0.106751, "lr": 0.018065, "mode": "train", "time_backward": 1.056604, "time_data": 0.021842, "time_diff": 1.486679, "time_forward": 0.400957, "time_loss": 0.000432}
[03/28 06:07:06] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "4890", "eta": "1:06:06", "loss": 0.091478, "lr": 0.018082, "mode": "train", "time_backward": 1.153028, "time_data": 0.023245, "time_diff": 1.622768, "time_forward": 0.440649, "time_loss": 0.000422}
[03/28 06:07:32] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "4900", "eta": "1:05:34", "loss": 0.101231, "lr": 0.018098, "mode": "train", "time_backward": 1.165877, "time_data": 0.017750, "time_diff": 1.591437, "time_forward": 0.398563, "time_loss": 0.000357}
[03/28 06:07:48] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "4910", "eta": "1:05:01", "loss": 0.100525, "lr": 0.018114, "mode": "train", "time_backward": 1.058010, "time_data": 0.018191, "time_diff": 1.483434, "time_forward": 0.403181, "time_loss": 0.000710}
[03/28 06:08:13] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "4920", "eta": "1:04:28", "loss": 0.091986, "lr": 0.018131, "mode": "train", "time_backward": 1.056879, "time_data": 0.016635, "time_diff": 1.474741, "time_forward": 0.397713, "time_loss": 0.000341}
[03/28 06:08:33] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "4930", "eta": "1:04:05", "loss": 0.086130, "lr": 0.018147, "mode": "train", "time_backward": 1.208650, "time_data": 3.580342, "time_diff": 5.278213, "time_forward": 0.480749, "time_loss": 0.000630}
[03/28 06:09:00] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "4940", "eta": "1:03:33", "loss": 0.098381, "lr": 0.018163, "mode": "train", "time_backward": 1.278066, "time_data": 0.024670, "time_diff": 1.727325, "time_forward": 0.412714, "time_loss": 0.000636}
[03/28 06:09:16] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "4950", "eta": "1:03:00", "loss": 0.091238, "lr": 0.018180, "mode": "train", "time_backward": 1.111704, "time_data": 0.019392, "time_diff": 1.540156, "time_forward": 0.402458, "time_loss": 0.000439}
[03/28 06:09:33] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "4960", "eta": "1:02:28", "loss": 0.100817, "lr": 0.018196, "mode": "train", "time_backward": 1.099782, "time_data": 0.019473, "time_diff": 1.526460, "time_forward": 0.400374, "time_loss": 0.000265}
[03/28 06:10:05] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "4970", "eta": "1:01:56", "loss": 0.084884, "lr": 0.018212, "mode": "train", "time_backward": 1.084205, "time_data": 0.017646, "time_diff": 1.681091, "time_forward": 0.453335, "time_loss": 0.000351}
[03/28 06:10:20] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "4980", "eta": "1:01:23", "loss": 0.098908, "lr": 0.018229, "mode": "train", "time_backward": 1.053828, "time_data": 0.022026, "time_diff": 1.480982, "time_forward": 0.400098, "time_loss": 0.000285}
[03/28 06:11:11] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "4990", "eta": "1:00:51", "loss": 0.091094, "lr": 0.018245, "mode": "train", "time_backward": 1.060786, "time_data": 0.017067, "time_diff": 1.481228, "time_forward": 0.399831, "time_loss": 0.000334}
[03/28 06:11:45] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "5000", "eta": "1:01:00", "loss": 0.095394, "lr": 0.018261, "mode": "train", "time_backward": 20.194558, "time_data": 0.017641, "time_diff": 20.706969, "time_forward": 0.485798, "time_loss": 0.000374}
[03/28 06:12:14] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "5010", "eta": "1:00:27", "loss": 0.088115, "lr": 0.018278, "mode": "train", "time_backward": 1.055615, "time_data": 0.017328, "time_diff": 1.519090, "time_forward": 0.442093, "time_loss": 0.000707}
[03/28 06:12:47] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "5020", "eta": "1:00:07", "loss": 0.097817, "lr": 0.018294, "mode": "train", "time_backward": 6.854454, "time_data": 0.016869, "time_diff": 7.273467, "time_forward": 0.397194, "time_loss": 0.000204}
[03/28 06:13:04] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "5030", "eta": "0:59:34", "loss": 0.108131, "lr": 0.018310, "mode": "train", "time_backward": 1.057840, "time_data": 0.018976, "time_diff": 1.484054, "time_forward": 0.400332, "time_loss": 0.000323}
[03/28 06:13:31] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "5040", "eta": "0:59:01", "loss": 0.101553, "lr": 0.018327, "mode": "train", "time_backward": 1.065315, "time_data": 0.017131, "time_diff": 1.549473, "time_forward": 0.457311, "time_loss": 0.000223}
[03/28 06:13:46] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "5050", "eta": "0:58:28", "loss": 0.095019, "lr": 0.018343, "mode": "train", "time_backward": 1.055179, "time_data": 0.019867, "time_diff": 1.504456, "time_forward": 0.401248, "time_loss": 0.000255}
[03/28 06:14:03] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "5060", "eta": "0:57:55", "loss": 0.093956, "lr": 0.018359, "mode": "train", "time_backward": 1.068818, "time_data": 0.017331, "time_diff": 1.495578, "time_forward": 0.399449, "time_loss": 0.000403}
[03/28 06:15:14] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "5070", "eta": "0:57:22", "loss": 0.097334, "lr": 0.018375, "mode": "train", "time_backward": 1.059178, "time_data": 0.017974, "time_diff": 1.483817, "time_forward": 0.399574, "time_loss": 0.000240}
[03/28 06:15:29] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "5080", "eta": "0:56:49", "loss": 0.097973, "lr": 0.018392, "mode": "train", "time_backward": 1.068712, "time_data": 0.017974, "time_diff": 1.490892, "time_forward": 0.400579, "time_loss": 0.000416}
[03/28 06:16:43] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "5090", "eta": "0:56:16", "loss": 0.105266, "lr": 0.018408, "mode": "train", "time_backward": 1.055796, "time_data": 0.017166, "time_diff": 1.523163, "time_forward": 0.446781, "time_loss": 0.000234}
[03/28 06:17:12] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "5100", "eta": "0:56:11", "loss": 0.092722, "lr": 0.018424, "mode": "train", "time_backward": 15.120154, "time_data": 0.017089, "time_diff": 15.543410, "time_forward": 0.399512, "time_loss": 0.000309}
[03/28 06:17:32] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "5110", "eta": "0:55:48", "loss": 0.102076, "lr": 0.018441, "mode": "train", "time_backward": 6.439513, "time_data": 0.017089, "time_diff": 6.861955, "time_forward": 0.398818, "time_loss": 0.000368}
[03/28 06:18:30] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "5120", "eta": "0:52:45", "loss": 0.087591, "lr": 0.018457, "mode": "train", "time_backward": 1.058031, "time_data": 0.018970, "time_diff": 1.481482, "time_forward": 0.400704, "time_loss": 0.000388}
[03/28 06:18:45] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "5130", "eta": "0:52:13", "loss": 0.097207, "lr": 0.018473, "mode": "train", "time_backward": 1.057181, "time_data": 0.017097, "time_diff": 1.484756, "time_forward": 0.400539, "time_loss": 0.000399}
[03/28 06:19:40] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "5140", "eta": "0:51:41", "loss": 0.104217, "lr": 0.018490, "mode": "train", "time_backward": 1.083245, "time_data": 0.025026, "time_diff": 1.532419, "time_forward": 0.416859, "time_loss": 0.000245}
[03/28 06:20:02] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "5150", "eta": "0:51:10", "loss": 0.095224, "lr": 0.018506, "mode": "train", "time_backward": 1.100157, "time_data": 0.016974, "time_diff": 1.523538, "time_forward": 0.399528, "time_loss": 0.000325}
[03/28 06:21:07] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "5160", "eta": "0:50:41", "loss": 0.095046, "lr": 0.018522, "mode": "train", "time_backward": 19.100181, "time_data": 0.016891, "time_diff": 19.539762, "time_forward": 0.398919, "time_loss": 0.000361}
[03/28 06:21:30] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "5170", "eta": "0:50:09", "loss": 0.090295, "lr": 0.018539, "mode": "train", "time_backward": 1.136964, "time_data": 0.027993, "time_diff": 1.622112, "time_forward": 0.438859, "time_loss": 0.017180}
[03/28 06:22:07] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "5180", "eta": "0:49:37", "loss": 0.087829, "lr": 0.018555, "mode": "train", "time_backward": 1.098836, "time_data": 0.019501, "time_diff": 1.526792, "time_forward": 0.398848, "time_loss": 0.000290}
[03/28 06:22:56] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "5190", "eta": "0:49:05", "loss": 0.091015, "lr": 0.018571, "mode": "train", "time_backward": 1.058944, "time_data": 0.017086, "time_diff": 1.482700, "time_forward": 0.399316, "time_loss": 0.000255}
[03/28 06:23:37] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "5200", "eta": "0:48:12", "loss": 0.091465, "lr": 0.018588, "mode": "train", "time_backward": 1.104630, "time_data": 0.018191, "time_diff": 1.529393, "time_forward": 0.398208, "time_loss": 0.000284}
[03/28 06:23:57] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "5210", "eta": "0:47:13", "loss": 0.100274, "lr": 0.018604, "mode": "train", "time_backward": 1.106761, "time_data": 0.018488, "time_diff": 1.530160, "time_forward": 0.399122, "time_loss": 0.000334}
[03/28 06:24:21] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "5220", "eta": "0:46:42", "loss": 0.096838, "lr": 0.018620, "mode": "train", "time_backward": 1.094560, "time_data": 0.019798, "time_diff": 1.519675, "time_forward": 0.400987, "time_loss": 0.000439}
[03/28 06:25:24] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "5230", "eta": "0:46:11", "loss": 0.092999, "lr": 0.018637, "mode": "train", "time_backward": 1.064206, "time_data": 0.017053, "time_diff": 1.521573, "time_forward": 0.437347, "time_loss": 0.000363}
[03/28 06:25:40] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "5240", "eta": "0:45:39", "loss": 0.098309, "lr": 0.018653, "mode": "train", "time_backward": 1.097223, "time_data": 0.022230, "time_diff": 1.575984, "time_forward": 0.451439, "time_loss": 0.000703}
[03/28 06:25:56] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "5250", "eta": "0:45:08", "loss": 0.094995, "lr": 0.018669, "mode": "train", "time_backward": 1.095421, "time_data": 0.029684, "time_diff": 1.535852, "time_forward": 0.398813, "time_loss": 0.000270}
[03/28 06:26:12] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "5260", "eta": "0:44:37", "loss": 0.091194, "lr": 0.018686, "mode": "train", "time_backward": 1.067671, "time_data": 0.017104, "time_diff": 1.506348, "time_forward": 0.415384, "time_loss": 0.000328}
[03/28 06:26:28] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "5270", "eta": "0:44:05", "loss": 0.103170, "lr": 0.018702, "mode": "train", "time_backward": 1.055254, "time_data": 0.022083, "time_diff": 1.560571, "time_forward": 0.470675, "time_loss": 0.000368}
[03/28 06:26:44] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "5280", "eta": "0:43:34", "loss": 0.099980, "lr": 0.018718, "mode": "train", "time_backward": 1.061056, "time_data": 0.024286, "time_diff": 1.488913, "time_forward": 0.400606, "time_loss": 0.000383}
[03/28 06:27:06] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "5290", "eta": "0:43:02", "loss": 0.098483, "lr": 0.018735, "mode": "train", "time_backward": 1.054977, "time_data": 0.016872, "time_diff": 1.542309, "time_forward": 0.452717, "time_loss": 0.000367}
[03/28 06:27:23] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "5300", "eta": "0:42:31", "loss": 0.103827, "lr": 0.018751, "mode": "train", "time_backward": 1.117039, "time_data": 0.017648, "time_diff": 1.545529, "time_forward": 0.401026, "time_loss": 0.000332}
[03/28 06:27:39] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "5310", "eta": "0:42:00", "loss": 0.102524, "lr": 0.018767, "mode": "train", "time_backward": 1.075292, "time_data": 0.019267, "time_diff": 1.499158, "time_forward": 0.400960, "time_loss": 0.000262}
[03/28 06:27:54] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "5320", "eta": "0:41:28", "loss": 0.106060, "lr": 0.018784, "mode": "train", "time_backward": 1.063910, "time_data": 0.016843, "time_diff": 1.485769, "time_forward": 0.398070, "time_loss": 0.000291}
[03/28 06:28:31] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "5330", "eta": "0:41:25", "loss": 0.099119, "lr": 0.018800, "mode": "train", "time_backward": 1.057940, "time_data": 21.706498, "time_diff": 23.185359, "time_forward": 0.417790, "time_loss": 0.000362}
[03/28 06:29:01] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "5340", "eta": "0:40:53", "loss": 0.100909, "lr": 0.018816, "mode": "train", "time_backward": 1.060631, "time_data": 0.016977, "time_diff": 1.526533, "time_forward": 0.424975, "time_loss": 0.000658}
[03/28 06:29:21] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "5350", "eta": "0:40:22", "loss": 0.106387, "lr": 0.018833, "mode": "train", "time_backward": 1.108081, "time_data": 0.018538, "time_diff": 1.564244, "time_forward": 0.407523, "time_loss": 0.000293}
[03/28 06:29:36] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "5360", "eta": "0:39:50", "loss": 0.086948, "lr": 0.018849, "mode": "train", "time_backward": 1.097414, "time_data": 0.018403, "time_diff": 1.645059, "time_forward": 0.525778, "time_loss": 0.000361}
[03/28 06:29:52] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "5370", "eta": "0:39:15", "loss": 0.108010, "lr": 0.018865, "mode": "train", "time_backward": 1.091666, "time_data": 0.021113, "time_diff": 1.533681, "time_forward": 0.399325, "time_loss": 0.000241}
[03/28 06:30:12] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "5380", "eta": "0:38:44", "loss": 0.096767, "lr": 0.018882, "mode": "train", "time_backward": 1.122520, "time_data": 0.022705, "time_diff": 1.556796, "time_forward": 0.402350, "time_loss": 0.000374}
[03/28 06:30:36] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "5390", "eta": "0:38:12", "loss": 0.104919, "lr": 0.018898, "mode": "train", "time_backward": 1.067300, "time_data": 0.028022, "time_diff": 1.518633, "time_forward": 0.419700, "time_loss": 0.000322}
[03/28 06:30:52] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "5400", "eta": "0:37:40", "loss": 0.090571, "lr": 0.018914, "mode": "train", "time_backward": 1.057948, "time_data": 0.020607, "time_diff": 1.485117, "time_forward": 0.402833, "time_loss": 0.000432}
[03/28 06:31:41] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "5410", "eta": "0:37:08", "loss": 0.093174, "lr": 0.018931, "mode": "train", "time_backward": 1.060367, "time_data": 0.017061, "time_diff": 1.489326, "time_forward": 0.408429, "time_loss": 0.000358}
[03/28 06:31:56] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "5420", "eta": "0:36:36", "loss": 0.092953, "lr": 0.018947, "mode": "train", "time_backward": 1.058325, "time_data": 0.019551, "time_diff": 1.503612, "time_forward": 0.399356, "time_loss": 0.000360}
[03/28 06:32:34] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "5430", "eta": "0:36:04", "loss": 0.101952, "lr": 0.018963, "mode": "train", "time_backward": 1.057375, "time_data": 0.016817, "time_diff": 1.484585, "time_forward": 0.398791, "time_loss": 0.000254}
[03/28 06:32:51] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "5440", "eta": "0:34:16", "loss": 0.094047, "lr": 0.018980, "mode": "train", "time_backward": 1.054838, "time_data": 0.017574, "time_diff": 1.487309, "time_forward": 0.404954, "time_loss": 0.000309}
[03/28 06:33:05] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "5450", "eta": "0:33:46", "loss": 0.094983, "lr": 0.018996, "mode": "train", "time_backward": 1.054151, "time_data": 0.016945, "time_diff": 1.492501, "time_forward": 0.414194, "time_loss": 0.000236}
[03/28 06:33:34] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "5460", "eta": "0:33:15", "loss": 0.099711, "lr": 0.019012, "mode": "train", "time_backward": 1.117237, "time_data": 0.016958, "time_diff": 1.564059, "time_forward": 0.426354, "time_loss": 0.000346}
[03/28 06:33:57] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "5470", "eta": "0:32:44", "loss": 0.078735, "lr": 0.019029, "mode": "train", "time_backward": 1.208723, "time_data": 0.023081, "time_diff": 1.753677, "time_forward": 0.473136, "time_loss": 0.000315}
[03/28 06:34:12] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "5480", "eta": "0:32:14", "loss": 0.091606, "lr": 0.019045, "mode": "train", "time_backward": 1.074700, "time_data": 0.035601, "time_diff": 1.512916, "time_forward": 0.398378, "time_loss": 0.000235}
[03/28 06:34:28] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "5490", "eta": "0:31:43", "loss": 0.093056, "lr": 0.019061, "mode": "train", "time_backward": 1.055037, "time_data": 0.016550, "time_diff": 1.477023, "time_forward": 0.397826, "time_loss": 0.000248}
[03/28 06:34:47] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "5500", "eta": "0:31:12", "loss": 0.094500, "lr": 0.019078, "mode": "train", "time_backward": 1.113747, "time_data": 0.016969, "time_diff": 1.543006, "time_forward": 0.399455, "time_loss": 0.000317}
[03/28 06:35:31] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "5510", "eta": "0:30:42", "loss": 0.110748, "lr": 0.019094, "mode": "train", "time_backward": 1.138182, "time_data": 0.016940, "time_diff": 1.605269, "time_forward": 0.398846, "time_loss": 0.000248}
[03/28 06:35:47] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "5520", "eta": "0:30:11", "loss": 0.090141, "lr": 0.019110, "mode": "train", "time_backward": 1.069095, "time_data": 0.049645, "time_diff": 1.536307, "time_forward": 0.410465, "time_loss": 0.000260}
[03/28 06:36:05] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "5530", "eta": "0:29:41", "loss": 0.089351, "lr": 0.019126, "mode": "train", "time_backward": 1.094950, "time_data": 0.017013, "time_diff": 1.522974, "time_forward": 0.402505, "time_loss": 0.000674}
[03/28 06:36:23] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "5540", "eta": "0:29:10", "loss": 0.097475, "lr": 0.019143, "mode": "train", "time_backward": 1.075732, "time_data": 0.037424, "time_diff": 1.601729, "time_forward": 0.424764, "time_loss": 0.000243}
[03/28 06:36:39] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "5550", "eta": "0:28:38", "loss": 0.103035, "lr": 0.019159, "mode": "train", "time_backward": 1.059753, "time_data": 0.016852, "time_diff": 1.496301, "time_forward": 0.399075, "time_loss": 0.000601}
[03/28 06:37:04] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "5560", "eta": "0:28:07", "loss": 0.102542, "lr": 0.019175, "mode": "train", "time_backward": 1.112240, "time_data": 0.016594, "time_diff": 1.534578, "time_forward": 0.402220, "time_loss": 0.000254}
[03/28 06:37:19] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "5570", "eta": "0:27:37", "loss": 0.088802, "lr": 0.019192, "mode": "train", "time_backward": 1.073387, "time_data": 0.018910, "time_diff": 1.513985, "time_forward": 0.413279, "time_loss": 0.000287}
[03/28 06:37:36] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "5580", "eta": "0:27:06", "loss": 0.087457, "lr": 0.019208, "mode": "train", "time_backward": 1.156008, "time_data": 0.020401, "time_diff": 1.577741, "time_forward": 0.400587, "time_loss": 0.000364}
[03/28 06:38:12] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "5590", "eta": "0:26:35", "loss": 0.088418, "lr": 0.019224, "mode": "train", "time_backward": 1.100044, "time_data": 0.017067, "time_diff": 1.523222, "time_forward": 0.399232, "time_loss": 0.000313}
[03/28 06:38:42] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "5600", "eta": "0:26:05", "loss": 0.107664, "lr": 0.019241, "mode": "train", "time_backward": 1.058828, "time_data": 0.023529, "time_diff": 1.533786, "time_forward": 0.447280, "time_loss": 0.000725}
[03/28 06:38:57] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "5610", "eta": "0:25:34", "loss": 0.093656, "lr": 0.019257, "mode": "train", "time_backward": 1.072179, "time_data": 0.017824, "time_diff": 1.507708, "time_forward": 0.405522, "time_loss": 0.000331}
[03/28 06:39:14] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "5620", "eta": "0:25:03", "loss": 0.094336, "lr": 0.019273, "mode": "train", "time_backward": 1.078240, "time_data": 0.017637, "time_diff": 1.499252, "time_forward": 0.399794, "time_loss": 0.000231}
[03/28 06:39:33] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "5630", "eta": "0:24:32", "loss": 0.100348, "lr": 0.019290, "mode": "train", "time_backward": 1.059035, "time_data": 0.016866, "time_diff": 1.497522, "time_forward": 0.401392, "time_loss": 0.000264}
[03/28 06:39:48] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "5640", "eta": "0:24:02", "loss": 0.105838, "lr": 0.019306, "mode": "train", "time_backward": 1.103755, "time_data": 0.018334, "time_diff": 1.542211, "time_forward": 0.398932, "time_loss": 0.000259}
[03/28 06:40:13] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "5650", "eta": "0:23:31", "loss": 0.102720, "lr": 0.019322, "mode": "train", "time_backward": 1.057647, "time_data": 0.019190, "time_diff": 1.481304, "time_forward": 0.400597, "time_loss": 0.000379}
[03/28 06:40:38] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "5660", "eta": "0:23:01", "loss": 0.095695, "lr": 0.019339, "mode": "train", "time_backward": 1.166029, "time_data": 0.017824, "time_diff": 1.659547, "time_forward": 0.438781, "time_loss": 0.000353}
[03/28 06:40:53] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "5670", "eta": "0:22:30", "loss": 0.099267, "lr": 0.019355, "mode": "train", "time_backward": 1.056721, "time_data": 0.018640, "time_diff": 1.480214, "time_forward": 0.401273, "time_loss": 0.000361}
[03/28 06:41:37] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "5680", "eta": "0:21:59", "loss": 0.103609, "lr": 0.019371, "mode": "train", "time_backward": 1.065383, "time_data": 0.017031, "time_diff": 1.484698, "time_forward": 0.399522, "time_loss": 0.000425}
[03/28 06:41:54] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "5690", "eta": "0:21:29", "loss": 0.076380, "lr": 0.019388, "mode": "train", "time_backward": 1.056862, "time_data": 0.017218, "time_diff": 1.480318, "time_forward": 0.399441, "time_loss": 0.000235}
[03/28 06:43:02] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "5700", "eta": "0:20:58", "loss": 0.097854, "lr": 0.019404, "mode": "train", "time_backward": 1.135779, "time_data": 0.017219, "time_diff": 1.567638, "time_forward": 0.400117, "time_loss": 0.000351}
[03/28 06:43:26] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "5710", "eta": "0:20:27", "loss": 0.095241, "lr": 0.019420, "mode": "train", "time_backward": 1.128833, "time_data": 0.017763, "time_diff": 1.558524, "time_forward": 0.400807, "time_loss": 0.000276}
[03/28 06:44:30] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "5720", "eta": "0:20:30", "loss": 0.094155, "lr": 0.019437, "mode": "train", "time_backward": 1.179274, "time_data": 48.033507, "time_diff": 49.984368, "time_forward": 0.699733, "time_loss": 0.000484}
[03/28 06:45:06] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "5730", "eta": "0:19:59", "loss": 0.106219, "lr": 0.019453, "mode": "train", "time_backward": 1.055270, "time_data": 0.017293, "time_diff": 1.556145, "time_forward": 0.399494, "time_loss": 0.000299}
[03/28 06:45:44] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "5740", "eta": "0:19:27", "loss": 0.091718, "lr": 0.019469, "mode": "train", "time_backward": 1.058623, "time_data": 0.018065, "time_diff": 1.482967, "time_forward": 0.402495, "time_loss": 0.000614}
[03/28 06:45:59] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "5750", "eta": "0:18:56", "loss": 0.100495, "lr": 0.019486, "mode": "train", "time_backward": 1.055757, "time_data": 0.018181, "time_diff": 1.479440, "time_forward": 0.399445, "time_loss": 0.000253}
[03/28 06:46:36] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "5760", "eta": "0:18:24", "loss": 0.097241, "lr": 0.019502, "mode": "train", "time_backward": 1.057652, "time_data": 0.016923, "time_diff": 1.482106, "time_forward": 0.400046, "time_loss": 0.000268}
[03/28 06:47:18] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "5770", "eta": "0:17:53", "loss": 0.083414, "lr": 0.019518, "mode": "train", "time_backward": 1.086235, "time_data": 0.016902, "time_diff": 1.520512, "time_forward": 0.413910, "time_loss": 0.000316}
[03/28 06:47:33] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "5780", "eta": "0:17:18", "loss": 0.094827, "lr": 0.019535, "mode": "train", "time_backward": 1.116825, "time_data": 0.019562, "time_diff": 1.592190, "time_forward": 0.431524, "time_loss": 0.002316}
[03/28 06:47:51] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "5790", "eta": "0:16:36", "loss": 0.092588, "lr": 0.019551, "mode": "train", "time_backward": 1.064438, "time_data": 0.017068, "time_diff": 1.488279, "time_forward": 0.400528, "time_loss": 0.000411}
[03/28 06:48:42] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "5800", "eta": "0:16:08", "loss": 0.104608, "lr": 0.019567, "mode": "train", "time_backward": 1.657834, "time_data": 10.237349, "time_diff": 13.079185, "time_forward": 1.059653, "time_loss": 0.107600}
[03/28 06:48:57] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "5810", "eta": "0:15:37", "loss": 0.092873, "lr": 0.019584, "mode": "train", "time_backward": 1.060457, "time_data": 0.017133, "time_diff": 1.480676, "time_forward": 0.399549, "time_loss": 0.000335}
[03/28 06:49:17] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "5820", "eta": "0:15:06", "loss": 0.090586, "lr": 0.019600, "mode": "train", "time_backward": 1.056305, "time_data": 0.017317, "time_diff": 1.484152, "time_forward": 0.399966, "time_loss": 0.000257}
[03/28 06:49:41] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "5830", "eta": "0:14:35", "loss": 0.093684, "lr": 0.019616, "mode": "train", "time_backward": 1.056468, "time_data": 0.017284, "time_diff": 1.521505, "time_forward": 0.444060, "time_loss": 0.000425}
[03/28 06:50:03] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "5840", "eta": "0:14:03", "loss": 0.090913, "lr": 0.019633, "mode": "train", "time_backward": 1.054920, "time_data": 0.016575, "time_diff": 1.477403, "time_forward": 0.397729, "time_loss": 0.000210}
[03/28 06:50:29] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "5850", "eta": "0:13:32", "loss": 0.090625, "lr": 0.019649, "mode": "train", "time_backward": 1.071517, "time_data": 0.017120, "time_diff": 1.587784, "time_forward": 0.455334, "time_loss": 0.001592}
[03/28 06:50:55] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "5860", "eta": "0:13:01", "loss": 0.098092, "lr": 0.019665, "mode": "train", "time_backward": 1.116878, "time_data": 0.017052, "time_diff": 1.542874, "time_forward": 0.399233, "time_loss": 0.000263}
[03/28 06:51:22] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "5870", "eta": "0:12:26", "loss": 0.085568, "lr": 0.019682, "mode": "train", "time_backward": 1.073406, "time_data": 0.017126, "time_diff": 1.508666, "time_forward": 0.405295, "time_loss": 0.000368}
[03/28 06:51:39] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "5880", "eta": "0:11:56", "loss": 0.095753, "lr": 0.019698, "mode": "train", "time_backward": 1.064737, "time_data": 1.534188, "time_diff": 3.006987, "time_forward": 0.400816, "time_loss": 0.000400}
[03/28 06:51:54] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "5890", "eta": "0:11:24", "loss": 0.094142, "lr": 0.019714, "mode": "train", "time_backward": 1.056031, "time_data": 0.018254, "time_diff": 1.480078, "time_forward": 0.400986, "time_loss": 0.000236}
[03/28 06:52:53] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "5900", "eta": "0:10:53", "loss": 0.100940, "lr": 0.019731, "mode": "train", "time_backward": 1.057633, "time_data": 0.016871, "time_diff": 1.506093, "time_forward": 0.399048, "time_loss": 0.000242}
[03/28 06:53:09] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "5910", "eta": "0:10:22", "loss": 0.093913, "lr": 0.019747, "mode": "train", "time_backward": 1.073324, "time_data": 0.017324, "time_diff": 1.506212, "time_forward": 0.400147, "time_loss": 0.000270}
[03/28 06:53:41] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "5920", "eta": "0:09:48", "loss": 0.091762, "lr": 0.019763, "mode": "train", "time_backward": 1.083400, "time_data": 0.017026, "time_diff": 1.502427, "time_forward": 0.398494, "time_loss": 0.000253}
[03/28 06:54:34] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "5930", "eta": "0:09:17", "loss": 0.096815, "lr": 0.019780, "mode": "train", "time_backward": 1.059013, "time_data": 0.017253, "time_diff": 1.479868, "time_forward": 0.400176, "time_loss": 0.000241}
[03/28 06:54:50] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "5940", "eta": "0:08:46", "loss": 0.088625, "lr": 0.019796, "mode": "train", "time_backward": 1.059871, "time_data": 0.016828, "time_diff": 1.481637, "time_forward": 0.399487, "time_loss": 0.000337}
[03/28 06:55:22] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "5950", "eta": "0:08:12", "loss": 0.091883, "lr": 0.019812, "mode": "train", "time_backward": 1.058135, "time_data": 0.018278, "time_diff": 1.479165, "time_forward": 0.398983, "time_loss": 0.000348}
[03/28 06:56:37] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "5960", "eta": "0:07:58", "loss": 0.094112, "lr": 0.019828, "mode": "train", "time_backward": 1.168328, "time_data": 55.679465, "time_diff": 57.753955, "time_forward": 0.867706, "time_loss": 0.008939}
[03/28 06:56:57] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "5970", "eta": "0:07:27", "loss": 0.096432, "lr": 0.019845, "mode": "train", "time_backward": 1.056835, "time_data": 0.017371, "time_diff": 1.480254, "time_forward": 0.402510, "time_loss": 0.000252}
[03/28 06:57:22] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "5980", "eta": "0:06:55", "loss": 0.093323, "lr": 0.019861, "mode": "train", "time_backward": 2.061516, "time_data": 0.022520, "time_diff": 2.527715, "time_forward": 0.440273, "time_loss": 0.000353}
[03/28 06:57:37] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "5990", "eta": "0:06:21", "loss": 0.096680, "lr": 0.019877, "mode": "train", "time_backward": 1.069558, "time_data": 0.017349, "time_diff": 1.492243, "time_forward": 0.399743, "time_loss": 0.000751}
[03/28 06:58:28] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "6000", "eta": "0:05:49", "loss": 0.087529, "lr": 0.019894, "mode": "train", "time_backward": 1.087118, "time_data": 0.028572, "time_diff": 1.649790, "time_forward": 0.526863, "time_loss": 0.000265}
[03/28 06:59:27] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "6010", "eta": "0:05:18", "loss": 0.097497, "lr": 0.019910, "mode": "train", "time_backward": 1.064009, "time_data": 0.017295, "time_diff": 1.481355, "time_forward": 0.399042, "time_loss": 0.000421}
[03/28 06:59:42] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "6020", "eta": "0:04:46", "loss": 0.098298, "lr": 0.019926, "mode": "train", "time_backward": 1.100574, "time_data": 0.017848, "time_diff": 1.521756, "time_forward": 0.400000, "time_loss": 0.000426}
[03/28 07:00:49] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "6030", "eta": "0:04:15", "loss": 0.096333, "lr": 0.019943, "mode": "train", "time_backward": 1.055457, "time_data": 0.018390, "time_diff": 1.481427, "time_forward": 0.399222, "time_loss": 0.000266}
[03/28 07:01:33] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "6040", "eta": "0:03:47", "loss": 0.084232, "lr": 0.019959, "mode": "train", "time_backward": 1.059554, "time_data": 28.580201, "time_diff": 30.086862, "time_forward": 0.415149, "time_loss": 0.000485}
[03/28 07:01:48] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "6050", "eta": "0:03:14", "loss": 0.087848, "lr": 0.019975, "mode": "train", "time_backward": 1.064947, "time_data": 0.016923, "time_diff": 1.488329, "time_forward": 0.403673, "time_loss": 0.000314}
[03/28 07:02:13] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "6060", "eta": "0:02:42", "loss": 0.093438, "lr": 0.019992, "mode": "train", "time_backward": 1.055165, "time_data": 0.018546, "time_diff": 1.528056, "time_forward": 0.450580, "time_loss": 0.000481}
[03/28 07:02:28] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "6070", "eta": "0:02:11", "loss": 0.099585, "lr": 0.020008, "mode": "train", "time_backward": 1.064002, "time_data": 0.016822, "time_diff": 1.482700, "time_forward": 0.399684, "time_loss": 0.000332}
[03/28 07:03:05] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "6080", "eta": "0:01:39", "loss": 0.092680, "lr": 0.020024, "mode": "train", "time_backward": 1.074478, "time_data": 0.033397, "time_diff": 1.713373, "time_forward": 0.598674, "time_loss": 0.000259}
[03/28 07:03:39] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "6090", "eta": "0:01:07", "loss": 0.087654, "lr": 0.020041, "mode": "train", "time_backward": 3.726689, "time_data": 0.017040, "time_diff": 4.148294, "time_forward": 0.398332, "time_loss": 0.000238}
[03/28 07:03:54] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "6100", "eta": "0:00:35", "loss": 0.096394, "lr": 0.020057, "mode": "train", "time_backward": 1.055650, "time_data": 0.016853, "time_diff": 1.476525, "time_forward": 0.399009, "time_loss": 0.000259}
[03/28 07:05:20] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "2", "cur_iter": "6110", "eta": "0:00:03", "loss": 0.096031, "lr": 0.020073, "mode": "train", "time_backward": 1.055107, "time_data": 0.017093, "time_diff": 1.473318, "time_forward": 0.397713, "time_loss": 0.000202}
[03/28 08:15:09] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "10", "eta": "5:25:31", "loss": 0.088245, "lr": 0.020090, "mode": "train", "time_backward": 1.058720, "time_data": 0.017308, "time_diff": 1.482401, "time_forward": 0.398824, "time_loss": 0.000263}
[03/28 08:16:25] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "20", "eta": "5:12:01", "loss": 0.095343, "lr": 0.020106, "mode": "train", "time_backward": 1.099535, "time_data": 0.017079, "time_diff": 1.572367, "time_forward": 0.436709, "time_loss": 0.000539}
[03/28 08:16:40] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "30", "eta": "5:11:30", "loss": 0.099067, "lr": 0.020122, "mode": "train", "time_backward": 1.055915, "time_data": 0.017019, "time_diff": 1.479389, "time_forward": 0.401197, "time_loss": 0.000382}
[03/28 08:17:43] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "40", "eta": "5:11:00", "loss": 0.093708, "lr": 0.020139, "mode": "train", "time_backward": 1.062417, "time_data": 0.017094, "time_diff": 1.486478, "time_forward": 0.399681, "time_loss": 0.000286}
[03/28 08:17:58] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "50", "eta": "5:10:29", "loss": 0.086447, "lr": 0.020155, "mode": "train", "time_backward": 1.101332, "time_data": 0.017339, "time_diff": 1.528431, "time_forward": 0.402024, "time_loss": 0.000515}
[03/28 08:18:13] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "60", "eta": "5:09:59", "loss": 0.086376, "lr": 0.020171, "mode": "train", "time_backward": 1.090511, "time_data": 0.035100, "time_diff": 1.576213, "time_forward": 0.409905, "time_loss": 0.000271}
[03/28 08:18:32] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "70", "eta": "5:09:27", "loss": 0.097915, "lr": 0.020188, "mode": "train", "time_backward": 1.098131, "time_data": 0.017466, "time_diff": 1.544425, "time_forward": 0.417252, "time_loss": 0.000260}
[03/28 08:18:49] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "80", "eta": "5:08:56", "loss": 0.086576, "lr": 0.020204, "mode": "train", "time_backward": 1.079073, "time_data": 0.018955, "time_diff": 1.522204, "time_forward": 0.399641, "time_loss": 0.000243}
[03/28 08:19:04] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "90", "eta": "5:08:25", "loss": 0.092661, "lr": 0.020220, "mode": "train", "time_backward": 1.069900, "time_data": 0.018517, "time_diff": 1.504988, "time_forward": 0.409030, "time_loss": 0.000270}
[03/28 08:19:21] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "100", "eta": "5:07:54", "loss": 0.094346, "lr": 0.020237, "mode": "train", "time_backward": 1.082513, "time_data": 0.020157, "time_diff": 1.516894, "time_forward": 0.403650, "time_loss": 0.000367}
[03/28 08:19:48] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "110", "eta": "5:07:31", "loss": 0.101143, "lr": 0.020253, "mode": "train", "time_backward": 1.779208, "time_data": 0.016772, "time_diff": 2.203656, "time_forward": 0.399405, "time_loss": 0.000278}
[03/28 08:20:40] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "120", "eta": "5:07:00", "loss": 0.088987, "lr": 0.020269, "mode": "train", "time_backward": 1.105225, "time_data": 0.020133, "time_diff": 1.541279, "time_forward": 0.403286, "time_loss": 0.000279}
[03/28 08:20:59] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "130", "eta": "5:06:30", "loss": 0.092696, "lr": 0.020286, "mode": "train", "time_backward": 1.056162, "time_data": 0.017616, "time_diff": 1.481192, "time_forward": 0.403721, "time_loss": 0.000396}
[03/28 08:22:02] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "140", "eta": "5:06:30", "loss": 0.100219, "lr": 0.020302, "mode": "train", "time_backward": 3.929594, "time_data": 0.016587, "time_diff": 4.354867, "time_forward": 0.400856, "time_loss": 0.000318}
[03/28 08:22:18] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "150", "eta": "5:05:59", "loss": 0.090921, "lr": 0.020318, "mode": "train", "time_backward": 1.079217, "time_data": 0.017009, "time_diff": 1.501207, "time_forward": 0.398704, "time_loss": 0.000248}
[03/28 08:22:36] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "160", "eta": "5:05:30", "loss": 0.086049, "lr": 0.020335, "mode": "train", "time_backward": 1.156378, "time_data": 0.016744, "time_diff": 1.711452, "time_forward": 0.423848, "time_loss": 0.000275}
[03/28 08:22:51] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "170", "eta": "5:05:00", "loss": 0.088617, "lr": 0.020351, "mode": "train", "time_backward": 1.118892, "time_data": 0.017829, "time_diff": 1.549158, "time_forward": 0.401321, "time_loss": 0.000389}
[03/28 08:23:59] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "180", "eta": "5:04:03", "loss": 0.086632, "lr": 0.020367, "mode": "train", "time_backward": 1.056975, "time_data": 0.038294, "time_diff": 1.541623, "time_forward": 0.442366, "time_loss": 0.000649}
[03/28 08:24:20] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "190", "eta": "5:03:33", "loss": 0.097942, "lr": 0.020384, "mode": "train", "time_backward": 1.069771, "time_data": 0.016942, "time_diff": 1.509725, "time_forward": 0.403264, "time_loss": 0.000283}
[03/28 08:24:44] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "200", "eta": "5:03:02", "loss": 0.095913, "lr": 0.020400, "mode": "train", "time_backward": 1.059626, "time_data": 0.017293, "time_diff": 1.482930, "time_forward": 0.399128, "time_loss": 0.000192}
[03/28 08:25:31] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "210", "eta": "5:02:31", "loss": 0.082047, "lr": 0.020416, "mode": "train", "time_backward": 1.058990, "time_data": 0.016859, "time_diff": 1.481791, "time_forward": 0.399412, "time_loss": 0.000245}
[03/28 08:25:51] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "220", "eta": "5:02:46", "loss": 0.090529, "lr": 0.020433, "mode": "train", "time_backward": 5.033985, "time_data": 0.017005, "time_diff": 5.454414, "time_forward": 0.399849, "time_loss": 0.000264}
[03/28 08:26:06] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "230", "eta": "5:02:15", "loss": 0.097027, "lr": 0.020449, "mode": "train", "time_backward": 1.056369, "time_data": 0.018128, "time_diff": 1.479407, "time_forward": 0.400053, "time_loss": 0.000611}
[03/28 08:26:34] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "240", "eta": "5:01:42", "loss": 0.095481, "lr": 0.020465, "mode": "train", "time_backward": 1.053981, "time_data": 0.016946, "time_diff": 1.487911, "time_forward": 0.399299, "time_loss": 0.000229}
[03/28 08:26:53] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "250", "eta": "5:01:52", "loss": 0.084381, "lr": 0.020482, "mode": "train", "time_backward": 4.619122, "time_data": 0.018726, "time_diff": 5.040634, "time_forward": 0.399357, "time_loss": 0.000322}
[03/28 08:27:20] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "260", "eta": "5:03:16", "loss": 0.098172, "lr": 0.020498, "mode": "train", "time_backward": 10.948883, "time_data": 0.016962, "time_diff": 11.370299, "time_forward": 0.398644, "time_loss": 0.000279}
[03/28 08:27:55] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "270", "eta": "5:02:45", "loss": 0.104480, "lr": 0.020514, "mode": "train", "time_backward": 1.057259, "time_data": 0.017389, "time_diff": 1.481149, "time_forward": 0.399403, "time_loss": 0.000239}
[03/28 08:28:32] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "280", "eta": "5:02:14", "loss": 0.099025, "lr": 0.020530, "mode": "train", "time_backward": 1.054954, "time_data": 0.016661, "time_diff": 1.477807, "time_forward": 0.397950, "time_loss": 0.000234}
[03/28 08:28:47] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "290", "eta": "5:01:43", "loss": 0.087458, "lr": 0.020547, "mode": "train", "time_backward": 1.081321, "time_data": 0.017965, "time_diff": 1.508678, "time_forward": 0.405715, "time_loss": 0.000404}
[03/28 08:29:05] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "300", "eta": "5:01:11", "loss": 0.098621, "lr": 0.020563, "mode": "train", "time_backward": 1.098857, "time_data": 0.017491, "time_diff": 1.519089, "time_forward": 0.398989, "time_loss": 0.000289}
[03/28 08:29:20] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "310", "eta": "5:00:40", "loss": 0.082088, "lr": 0.020579, "mode": "train", "time_backward": 1.074000, "time_data": 0.018572, "time_diff": 1.519737, "time_forward": 0.423695, "time_loss": 0.000333}
[03/28 08:29:44] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "320", "eta": "5:00:08", "loss": 0.088774, "lr": 0.020596, "mode": "train", "time_backward": 1.122221, "time_data": 0.016900, "time_diff": 1.541590, "time_forward": 0.398974, "time_loss": 0.000238}
[03/28 08:30:01] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "330", "eta": "4:59:38", "loss": 0.103134, "lr": 0.020612, "mode": "train", "time_backward": 1.116732, "time_data": 0.017125, "time_diff": 1.539672, "time_forward": 0.398403, "time_loss": 0.000249}
[03/28 08:30:16] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "340", "eta": "4:59:03", "loss": 0.089319, "lr": 0.020628, "mode": "train", "time_backward": 1.055688, "time_data": 0.017945, "time_diff": 1.481550, "time_forward": 0.401213, "time_loss": 0.000290}
[03/28 08:30:56] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "350", "eta": "4:58:32", "loss": 0.103484, "lr": 0.020645, "mode": "train", "time_backward": 1.056201, "time_data": 0.017185, "time_diff": 1.479549, "time_forward": 0.399280, "time_loss": 0.000408}
[03/28 08:31:47] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "360", "eta": "5:04:23", "loss": 0.096156, "lr": 0.020661, "mode": "train", "time_backward": 27.273004, "time_data": 6.019503, "time_diff": 34.986399, "time_forward": 1.622670, "time_loss": 0.022384}
[03/28 08:32:03] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "370", "eta": "5:03:52", "loss": 0.082359, "lr": 0.020677, "mode": "train", "time_backward": 1.112283, "time_data": 0.017256, "time_diff": 1.533884, "time_forward": 0.400729, "time_loss": 0.000457}
[03/28 08:33:14] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "380", "eta": "5:03:18", "loss": 0.087987, "lr": 0.020694, "mode": "train", "time_backward": 1.079640, "time_data": 0.018113, "time_diff": 1.539629, "time_forward": 0.398489, "time_loss": 0.000491}
[03/28 08:33:29] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "390", "eta": "5:02:46", "loss": 0.098523, "lr": 0.020710, "mode": "train", "time_backward": 1.068768, "time_data": 0.046440, "time_diff": 1.520860, "time_forward": 0.398544, "time_loss": 0.000273}
[03/28 08:33:51] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "400", "eta": "5:02:14", "loss": 0.088362, "lr": 0.020726, "mode": "train", "time_backward": 1.062773, "time_data": 0.017336, "time_diff": 1.520373, "time_forward": 0.399721, "time_loss": 0.000361}
[03/28 08:34:18] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "410", "eta": "5:01:42", "loss": 0.091725, "lr": 0.020743, "mode": "train", "time_backward": 1.092595, "time_data": 0.017526, "time_diff": 1.522099, "time_forward": 0.398144, "time_loss": 0.000245}
[03/28 08:34:34] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "420", "eta": "5:01:00", "loss": 0.085121, "lr": 0.020759, "mode": "train", "time_backward": 1.125772, "time_data": 0.017607, "time_diff": 1.557906, "time_forward": 0.409940, "time_loss": 0.000291}
[03/28 08:34:58] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "430", "eta": "5:00:28", "loss": 0.087903, "lr": 0.020775, "mode": "train", "time_backward": 1.052451, "time_data": 0.016596, "time_diff": 1.472842, "time_forward": 0.397453, "time_loss": 0.000232}
[03/28 08:35:27] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "440", "eta": "4:59:13", "loss": 0.083021, "lr": 0.020792, "mode": "train", "time_backward": 1.111273, "time_data": 0.017092, "time_diff": 1.540627, "time_forward": 0.399631, "time_loss": 0.000305}
[03/28 08:35:45] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "450", "eta": "4:58:41", "loss": 0.094523, "lr": 0.020808, "mode": "train", "time_backward": 1.057086, "time_data": 0.018046, "time_diff": 1.490772, "time_forward": 0.398556, "time_loss": 0.000234}
[03/28 08:36:16] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "460", "eta": "4:58:01", "loss": 0.083581, "lr": 0.020824, "mode": "train", "time_backward": 1.060955, "time_data": 0.017064, "time_diff": 1.505373, "time_forward": 0.399304, "time_loss": 0.000276}
[03/28 08:36:37] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "470", "eta": "4:57:29", "loss": 0.088205, "lr": 0.020841, "mode": "train", "time_backward": 1.062342, "time_data": 0.017693, "time_diff": 1.517400, "time_forward": 0.433721, "time_loss": 0.000372}
[03/28 08:36:55] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "480", "eta": "4:56:57", "loss": 0.086130, "lr": 0.020857, "mode": "train", "time_backward": 1.084364, "time_data": 0.017387, "time_diff": 1.520414, "time_forward": 0.398467, "time_loss": 0.000334}
[03/28 08:37:17] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "490", "eta": "4:56:26", "loss": 0.090602, "lr": 0.020873, "mode": "train", "time_backward": 1.062474, "time_data": 0.017054, "time_diff": 1.530198, "time_forward": 0.447335, "time_loss": 0.000268}
[03/28 08:37:41] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "500", "eta": "4:55:55", "loss": 0.094960, "lr": 0.020890, "mode": "train", "time_backward": 1.074544, "time_data": 0.016855, "time_diff": 1.500887, "time_forward": 0.401547, "time_loss": 0.000314}
[03/28 08:38:03] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "510", "eta": "4:55:22", "loss": 0.085866, "lr": 0.020906, "mode": "train", "time_backward": 1.133281, "time_data": 0.017013, "time_diff": 1.554602, "time_forward": 0.401601, "time_loss": 0.000283}
[03/28 08:38:42] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "520", "eta": "4:54:50", "loss": 0.082230, "lr": 0.020922, "mode": "train", "time_backward": 1.142800, "time_data": 0.016994, "time_diff": 1.567184, "time_forward": 0.398827, "time_loss": 0.000233}
[03/28 08:38:59] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "530", "eta": "4:54:19", "loss": 0.096921, "lr": 0.020939, "mode": "train", "time_backward": 1.088164, "time_data": 0.017321, "time_diff": 1.520015, "time_forward": 0.406958, "time_loss": 0.000338}
[03/28 08:39:18] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "540", "eta": "4:53:46", "loss": 0.082206, "lr": 0.020955, "mode": "train", "time_backward": 1.054684, "time_data": 0.019062, "time_diff": 1.479657, "time_forward": 0.401527, "time_loss": 0.000320}
[03/28 08:39:38] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "550", "eta": "4:53:13", "loss": 0.100455, "lr": 0.020971, "mode": "train", "time_backward": 1.056883, "time_data": 0.017982, "time_diff": 1.487606, "time_forward": 0.401814, "time_loss": 0.000277}
[03/28 08:40:47] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "560", "eta": "4:52:41", "loss": 0.089774, "lr": 0.020988, "mode": "train", "time_backward": 1.061361, "time_data": 0.017074, "time_diff": 1.496548, "time_forward": 0.399781, "time_loss": 0.000281}
[03/28 08:41:03] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "570", "eta": "4:52:10", "loss": 0.094325, "lr": 0.021004, "mode": "train", "time_backward": 1.061748, "time_data": 0.017318, "time_diff": 1.497651, "time_forward": 0.415326, "time_loss": 0.000821}
[03/28 08:41:33] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "580", "eta": "4:51:38", "loss": 0.103538, "lr": 0.021020, "mode": "train", "time_backward": 1.070392, "time_data": 0.017736, "time_diff": 1.519471, "time_forward": 0.402409, "time_loss": 0.001299}
[03/28 08:42:08] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "590", "eta": "4:53:02", "loss": 0.088876, "lr": 0.021037, "mode": "train", "time_backward": 1.212118, "time_data": 10.372583, "time_diff": 12.435083, "time_forward": 0.842294, "time_loss": 0.000366}
[03/28 08:42:23] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "600", "eta": "4:52:30", "loss": 0.099069, "lr": 0.021053, "mode": "train", "time_backward": 1.059844, "time_data": 0.019509, "time_diff": 1.483413, "time_forward": 0.400634, "time_loss": 0.000375}
[03/28 08:43:17] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "610", "eta": "4:51:58", "loss": 0.092487, "lr": 0.021069, "mode": "train", "time_backward": 1.058574, "time_data": 0.017361, "time_diff": 1.487672, "time_forward": 0.401172, "time_loss": 0.000754}
[03/28 08:44:04] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "620", "eta": "4:57:20", "loss": 0.084564, "lr": 0.021086, "mode": "train", "time_backward": 1.059422, "time_data": 32.192521, "time_diff": 33.777181, "time_forward": 0.469594, "time_loss": 0.015209}
[03/28 08:44:21] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "630", "eta": "4:56:52", "loss": 0.097085, "lr": 0.021102, "mode": "train", "time_backward": 1.258328, "time_data": 0.039975, "time_diff": 2.023881, "time_forward": 0.720490, "time_loss": 0.000408}
[03/28 08:45:26] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "640", "eta": "4:56:20", "loss": 0.088613, "lr": 0.021118, "mode": "train", "time_backward": 1.056610, "time_data": 0.016803, "time_diff": 1.477464, "time_forward": 0.399583, "time_loss": 0.000239}
[03/28 08:45:42] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "650", "eta": "4:55:48", "loss": 0.090378, "lr": 0.021135, "mode": "train", "time_backward": 1.091459, "time_data": 0.017543, "time_diff": 1.510638, "time_forward": 0.399484, "time_loss": 0.000249}
[03/28 08:46:04] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "660", "eta": "4:55:15", "loss": 0.094942, "lr": 0.021151, "mode": "train", "time_backward": 1.056145, "time_data": 0.016995, "time_diff": 1.479346, "time_forward": 0.399330, "time_loss": 0.000331}
[03/28 08:46:30] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "670", "eta": "4:54:13", "loss": 0.088833, "lr": 0.021167, "mode": "train", "time_backward": 1.122016, "time_data": 0.017710, "time_diff": 1.539945, "time_forward": 0.398720, "time_loss": 0.000292}
[03/28 08:46:45] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "680", "eta": "4:53:41", "loss": 0.091383, "lr": 0.021184, "mode": "train", "time_backward": 1.123351, "time_data": 0.017635, "time_diff": 1.548440, "time_forward": 0.399418, "time_loss": 0.000380}
[03/28 08:47:06] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "690", "eta": "4:53:07", "loss": 0.091180, "lr": 0.021200, "mode": "train", "time_backward": 1.057772, "time_data": 0.017010, "time_diff": 1.478026, "time_forward": 0.399048, "time_loss": 0.000231}
[03/28 08:47:41] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "700", "eta": "4:54:09", "loss": 0.086105, "lr": 0.021216, "mode": "train", "time_backward": 1.132005, "time_data": 8.386112, "time_diff": 10.247153, "time_forward": 0.676956, "time_loss": 0.037108}
[03/28 08:48:59] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "710", "eta": "4:53:37", "loss": 0.091720, "lr": 0.021232, "mode": "train", "time_backward": 1.100595, "time_data": 0.019357, "time_diff": 1.528080, "time_forward": 0.400192, "time_loss": 0.000252}
[03/28 08:49:14] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "720", "eta": "4:53:01", "loss": 0.083868, "lr": 0.021249, "mode": "train", "time_backward": 1.093188, "time_data": 0.017036, "time_diff": 1.517523, "time_forward": 0.398953, "time_loss": 0.000256}
[03/28 08:49:29] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "730", "eta": "4:52:28", "loss": 0.092411, "lr": 0.021265, "mode": "train", "time_backward": 1.062774, "time_data": 0.016721, "time_diff": 1.485657, "time_forward": 0.397906, "time_loss": 0.000215}
[03/28 08:50:07] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "740", "eta": "4:55:21", "loss": 0.087592, "lr": 0.021281, "mode": "train", "time_backward": 20.137687, "time_data": 0.017292, "time_diff": 20.595552, "time_forward": 0.399528, "time_loss": 0.000296}
[03/28 08:50:22] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "750", "eta": "4:52:49", "loss": 0.083365, "lr": 0.021298, "mode": "train", "time_backward": 1.056190, "time_data": 0.019040, "time_diff": 1.480692, "time_forward": 0.399387, "time_loss": 0.000264}
[03/28 08:51:01] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "760", "eta": "4:52:17", "loss": 0.091078, "lr": 0.021314, "mode": "train", "time_backward": 1.123186, "time_data": 0.017090, "time_diff": 1.561346, "time_forward": 0.400678, "time_loss": 0.000259}
[03/28 08:51:17] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "770", "eta": "4:51:43", "loss": 0.076067, "lr": 0.021330, "mode": "train", "time_backward": 1.061321, "time_data": 0.018040, "time_diff": 1.520010, "time_forward": 0.400345, "time_loss": 0.000387}
[03/28 08:51:32] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "780", "eta": "4:51:09", "loss": 0.089963, "lr": 0.021347, "mode": "train", "time_backward": 1.065866, "time_data": 0.017122, "time_diff": 1.486408, "time_forward": 0.399873, "time_loss": 0.000284}
[03/28 08:51:47] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "790", "eta": "4:50:05", "loss": 0.096050, "lr": 0.021363, "mode": "train", "time_backward": 1.055241, "time_data": 0.018251, "time_diff": 1.528024, "time_forward": 0.451223, "time_loss": 0.000423}
[03/28 08:52:17] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "800", "eta": "4:43:34", "loss": 0.078259, "lr": 0.021379, "mode": "train", "time_backward": 1.108165, "time_data": 0.024189, "time_diff": 1.624338, "time_forward": 0.479871, "time_loss": 0.000902}
[03/28 08:52:34] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "810", "eta": "4:43:03", "loss": 0.079018, "lr": 0.021396, "mode": "train", "time_backward": 1.080988, "time_data": 0.016953, "time_diff": 1.514913, "time_forward": 0.409237, "time_loss": 0.001369}
[03/28 08:52:53] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "820", "eta": "4:43:04", "loss": 0.092612, "lr": 0.021412, "mode": "train", "time_backward": 4.217883, "time_data": 0.018379, "time_diff": 4.698796, "time_forward": 0.401951, "time_loss": 0.000247}
[03/28 08:53:09] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "830", "eta": "4:42:32", "loss": 0.087073, "lr": 0.021428, "mode": "train", "time_backward": 1.052218, "time_data": 0.016874, "time_diff": 1.477119, "time_forward": 0.397073, "time_loss": 0.000206}
[03/28 08:54:36] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "840", "eta": "4:42:00", "loss": 0.093590, "lr": 0.021445, "mode": "train", "time_backward": 1.054695, "time_data": 0.016816, "time_diff": 1.477041, "time_forward": 0.399172, "time_loss": 0.000254}
[03/28 08:54:57] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "850", "eta": "4:40:59", "loss": 0.097590, "lr": 0.021461, "mode": "train", "time_backward": 1.059191, "time_data": 0.017445, "time_diff": 1.483388, "time_forward": 0.398930, "time_loss": 0.000356}
[03/28 08:55:19] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "860", "eta": "4:40:27", "loss": 0.090299, "lr": 0.021477, "mode": "train", "time_backward": 1.069488, "time_data": 0.017364, "time_diff": 1.489558, "time_forward": 0.399392, "time_loss": 0.000325}
[03/28 08:55:38] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "870", "eta": "4:39:36", "loss": 0.083336, "lr": 0.021494, "mode": "train", "time_backward": 1.057045, "time_data": 0.016974, "time_diff": 1.513578, "time_forward": 0.398349, "time_loss": 0.000300}
[03/28 08:55:53] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "880", "eta": "4:39:04", "loss": 0.090933, "lr": 0.021510, "mode": "train", "time_backward": 1.071801, "time_data": 0.020063, "time_diff": 1.498420, "time_forward": 0.399140, "time_loss": 0.000209}
[03/28 08:56:27] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "890", "eta": "4:38:32", "loss": 0.084831, "lr": 0.021526, "mode": "train", "time_backward": 1.056514, "time_data": 0.017293, "time_diff": 1.480934, "time_forward": 0.398917, "time_loss": 0.000276}
[03/28 08:56:58] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "900", "eta": "4:36:22", "loss": 0.083601, "lr": 0.021543, "mode": "train", "time_backward": 1.057517, "time_data": 0.028079, "time_diff": 1.495209, "time_forward": 0.405722, "time_loss": 0.000439}
[03/28 08:57:29] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "910", "eta": "4:35:50", "loss": 0.087742, "lr": 0.021559, "mode": "train", "time_backward": 1.054670, "time_data": 0.019379, "time_diff": 1.480860, "time_forward": 0.403194, "time_loss": 0.000345}
[03/28 08:58:01] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "920", "eta": "4:36:46", "loss": 0.093084, "lr": 0.021575, "mode": "train", "time_backward": 12.027985, "time_data": 0.017134, "time_diff": 12.450190, "time_forward": 0.398933, "time_loss": 0.000250}
[03/28 08:58:19] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "930", "eta": "4:36:14", "loss": 0.086755, "lr": 0.021592, "mode": "train", "time_backward": 1.082831, "time_data": 0.018277, "time_diff": 1.506443, "time_forward": 0.398265, "time_loss": 0.000269}
[03/28 08:58:41] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "940", "eta": "4:35:43", "loss": 0.089928, "lr": 0.021608, "mode": "train", "time_backward": 1.111248, "time_data": 0.017117, "time_diff": 1.534709, "time_forward": 0.399481, "time_loss": 0.000323}
[03/28 08:58:56] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "950", "eta": "4:35:11", "loss": 0.086605, "lr": 0.021624, "mode": "train", "time_backward": 1.063918, "time_data": 0.018985, "time_diff": 1.490588, "time_forward": 0.398792, "time_loss": 0.000264}
[03/28 08:59:26] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "960", "eta": "4:34:39", "loss": 0.086218, "lr": 0.021641, "mode": "train", "time_backward": 1.066066, "time_data": 0.016853, "time_diff": 1.489817, "time_forward": 0.399530, "time_loss": 0.000252}
[03/28 08:59:45] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "970", "eta": "4:34:07", "loss": 0.091788, "lr": 0.021657, "mode": "train", "time_backward": 1.131069, "time_data": 0.018282, "time_diff": 1.556654, "time_forward": 0.399581, "time_loss": 0.000273}
[03/28 09:00:01] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "980", "eta": "4:23:58", "loss": 0.089213, "lr": 0.021673, "mode": "train", "time_backward": 1.122640, "time_data": 0.024108, "time_diff": 1.566352, "time_forward": 0.399122, "time_loss": 0.000258}
[03/28 09:00:16] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "990", "eta": "4:23:28", "loss": 0.092934, "lr": 0.021690, "mode": "train", "time_backward": 1.058454, "time_data": 0.016636, "time_diff": 1.478878, "time_forward": 0.397725, "time_loss": 0.000205}
[03/28 09:00:34] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "1000", "eta": "4:23:16", "loss": 0.091513, "lr": 0.021706, "mode": "train", "time_backward": 2.907795, "time_data": 0.017041, "time_diff": 3.448305, "time_forward": 0.484196, "time_loss": 0.000412}
[03/28 09:00:50] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "1010", "eta": "4:22:47", "loss": 0.088717, "lr": 0.021722, "mode": "train", "time_backward": 1.153654, "time_data": 0.037636, "time_diff": 1.631217, "time_forward": 0.399795, "time_loss": 0.000373}
[03/28 09:01:08] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "1020", "eta": "4:22:16", "loss": 0.085006, "lr": 0.021739, "mode": "train", "time_backward": 1.081220, "time_data": 0.017623, "time_diff": 1.508033, "time_forward": 0.401677, "time_loss": 0.000264}
[03/28 09:01:23] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "1030", "eta": "4:21:45", "loss": 0.083605, "lr": 0.021755, "mode": "train", "time_backward": 1.057178, "time_data": 0.016821, "time_diff": 1.484466, "time_forward": 0.402096, "time_loss": 0.000886}
[03/28 09:01:55] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "1040", "eta": "4:21:15", "loss": 0.093639, "lr": 0.021771, "mode": "train", "time_backward": 1.068554, "time_data": 0.019694, "time_diff": 1.585246, "time_forward": 0.407686, "time_loss": 0.000338}
[03/28 09:02:15] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "1050", "eta": "4:20:44", "loss": 0.087706, "lr": 0.021788, "mode": "train", "time_backward": 1.059001, "time_data": 0.017468, "time_diff": 1.480539, "time_forward": 0.400268, "time_loss": 0.000363}
[03/28 09:02:30] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "1060", "eta": "4:15:48", "loss": 0.084660, "lr": 0.021804, "mode": "train", "time_backward": 1.101374, "time_data": 0.016897, "time_diff": 1.525939, "time_forward": 0.401475, "time_loss": 0.000676}
[03/28 09:03:24] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "1070", "eta": "4:15:18", "loss": 0.084324, "lr": 0.021820, "mode": "train", "time_backward": 1.057126, "time_data": 0.018407, "time_diff": 1.478324, "time_forward": 0.399462, "time_loss": 0.000250}
[03/28 09:03:48] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "1080", "eta": "4:14:48", "loss": 0.095274, "lr": 0.021837, "mode": "train", "time_backward": 1.090988, "time_data": 0.016766, "time_diff": 1.512620, "time_forward": 0.399739, "time_loss": 0.000424}
[03/28 09:04:03] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "1090", "eta": "4:14:17", "loss": 0.082301, "lr": 0.021853, "mode": "train", "time_backward": 1.057265, "time_data": 0.016906, "time_diff": 1.479001, "time_forward": 0.399472, "time_loss": 0.000328}
[03/28 09:04:39] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "1100", "eta": "4:13:47", "loss": 0.083181, "lr": 0.021869, "mode": "train", "time_backward": 1.088718, "time_data": 0.018884, "time_diff": 1.518768, "time_forward": 0.404239, "time_loss": 0.001020}
[03/28 09:04:56] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "1110", "eta": "4:13:16", "loss": 0.086828, "lr": 0.021886, "mode": "train", "time_backward": 1.055358, "time_data": 0.017450, "time_diff": 1.476544, "time_forward": 0.399411, "time_loss": 0.000340}
[03/28 09:05:42] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "1120", "eta": "4:12:46", "loss": 0.100425, "lr": 0.021902, "mode": "train", "time_backward": 1.100626, "time_data": 0.017515, "time_diff": 1.526347, "time_forward": 0.400072, "time_loss": 0.000257}
[03/28 09:06:11] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "1130", "eta": "4:12:16", "loss": 0.086534, "lr": 0.021918, "mode": "train", "time_backward": 1.057571, "time_data": 0.023062, "time_diff": 1.488366, "time_forward": 0.404295, "time_loss": 0.000260}
[03/28 09:06:25] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "1140", "eta": "4:06:13", "loss": 0.082382, "lr": 0.021934, "mode": "train", "time_backward": 1.057721, "time_data": 0.017272, "time_diff": 1.481327, "time_forward": 0.399159, "time_loss": 0.000304}
[03/28 09:07:29] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "1150", "eta": "4:05:43", "loss": 0.076844, "lr": 0.021951, "mode": "train", "time_backward": 1.057475, "time_data": 0.017137, "time_diff": 1.486272, "time_forward": 0.400001, "time_loss": 0.000244}
[03/28 09:08:01] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "1160", "eta": "4:07:55", "loss": 0.079672, "lr": 0.021967, "mode": "train", "time_backward": 17.444121, "time_data": 0.020873, "time_diff": 17.875440, "time_forward": 0.407295, "time_loss": 0.000261}
[03/28 09:08:16] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "1170", "eta": "4:07:25", "loss": 0.083844, "lr": 0.021983, "mode": "train", "time_backward": 1.057609, "time_data": 0.016758, "time_diff": 1.478276, "time_forward": 0.400314, "time_loss": 0.000474}
[03/28 09:08:42] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "1180", "eta": "4:06:56", "loss": 0.082433, "lr": 0.022000, "mode": "train", "time_backward": 1.117220, "time_data": 0.018215, "time_diff": 1.556073, "time_forward": 0.399290, "time_loss": 0.000346}
[03/28 09:08:57] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "1190", "eta": "4:06:25", "loss": 0.087901, "lr": 0.022016, "mode": "train", "time_backward": 1.056945, "time_data": 0.020920, "time_diff": 1.482130, "time_forward": 0.399367, "time_loss": 0.000247}
[03/28 09:09:30] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "1200", "eta": "4:08:35", "loss": 0.098879, "lr": 0.022032, "mode": "train", "time_backward": 1.245345, "time_data": 15.727960, "time_diff": 17.834522, "time_forward": 0.852786, "time_loss": 0.000400}
[03/28 09:09:49] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "1210", "eta": "4:08:06", "loss": 0.090581, "lr": 0.022049, "mode": "train", "time_backward": 1.126314, "time_data": 0.018029, "time_diff": 1.594850, "time_forward": 0.432110, "time_loss": 0.000265}
[03/28 09:10:11] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "1220", "eta": "4:07:36", "loss": 0.082921, "lr": 0.022065, "mode": "train", "time_backward": 1.054749, "time_data": 0.017785, "time_diff": 1.477522, "time_forward": 0.400075, "time_loss": 0.000419}
[03/28 09:10:31] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "1230", "eta": "4:07:04", "loss": 0.106990, "lr": 0.022081, "mode": "train", "time_backward": 1.056438, "time_data": 0.016997, "time_diff": 1.479083, "time_forward": 0.399438, "time_loss": 0.000277}
[03/28 09:11:09] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "1240", "eta": "4:09:18", "loss": 0.080252, "lr": 0.022098, "mode": "train", "time_backward": 17.784941, "time_data": 0.017091, "time_diff": 18.328987, "time_forward": 0.398879, "time_loss": 0.000251}
[03/28 09:11:24] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "1250", "eta": "4:08:47", "loss": 0.081497, "lr": 0.022114, "mode": "train", "time_backward": 1.080510, "time_data": 0.018040, "time_diff": 1.504652, "time_forward": 0.402146, "time_loss": 0.000270}
[03/28 09:11:42] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "1260", "eta": "4:08:16", "loss": 0.085538, "lr": 0.022130, "mode": "train", "time_backward": 1.084356, "time_data": 0.016783, "time_diff": 1.506042, "time_forward": 0.399215, "time_loss": 0.000252}
[03/28 09:11:59] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "1270", "eta": "4:07:47", "loss": 0.086229, "lr": 0.022147, "mode": "train", "time_backward": 1.065621, "time_data": 0.017588, "time_diff": 1.622900, "time_forward": 0.407430, "time_loss": 0.000390}
[03/28 09:12:19] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "1280", "eta": "4:07:16", "loss": 0.088624, "lr": 0.022163, "mode": "train", "time_backward": 1.057344, "time_data": 0.017577, "time_diff": 1.529068, "time_forward": 0.450605, "time_loss": 0.000296}
[03/28 09:12:49] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "1290", "eta": "4:06:49", "loss": 0.090286, "lr": 0.022179, "mode": "train", "time_backward": 1.369125, "time_data": 0.016863, "time_diff": 1.834018, "time_forward": 0.440890, "time_loss": 0.000576}
[03/28 09:13:05] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "1300", "eta": "4:06:19", "loss": 0.090368, "lr": 0.022196, "mode": "train", "time_backward": 1.126806, "time_data": 0.069080, "time_diff": 1.609082, "time_forward": 0.400428, "time_loss": 0.000359}
[03/28 09:13:29] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "1310", "eta": "4:05:49", "loss": 0.095707, "lr": 0.022212, "mode": "train", "time_backward": 1.071296, "time_data": 0.019587, "time_diff": 1.519114, "time_forward": 0.412210, "time_loss": 0.000389}
[03/28 09:13:58] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "1320", "eta": "4:05:18", "loss": 0.080042, "lr": 0.022228, "mode": "train", "time_backward": 1.060849, "time_data": 0.017005, "time_diff": 1.489893, "time_forward": 0.399998, "time_loss": 0.000755}
[03/28 09:14:14] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "1330", "eta": "4:04:48", "loss": 0.085868, "lr": 0.022245, "mode": "train", "time_backward": 1.062099, "time_data": 0.018726, "time_diff": 1.565620, "time_forward": 0.481182, "time_loss": 0.000353}
[03/28 09:14:47] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "1340", "eta": "4:06:19", "loss": 0.082445, "lr": 0.022261, "mode": "train", "time_backward": 13.861831, "time_data": 0.027268, "time_diff": 14.300714, "time_forward": 0.404609, "time_loss": 0.000490}
[03/28 09:15:46] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "1350", "eta": "4:05:48", "loss": 0.089157, "lr": 0.022277, "mode": "train", "time_backward": 1.056501, "time_data": 0.017308, "time_diff": 1.528924, "time_forward": 0.451398, "time_loss": 0.000348}
[03/28 09:16:04] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "1360", "eta": "4:05:18", "loss": 0.088211, "lr": 0.022294, "mode": "train", "time_backward": 1.142398, "time_data": 0.017657, "time_diff": 1.567168, "time_forward": 0.399175, "time_loss": 0.000285}
[03/28 09:16:41] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "1370", "eta": "4:04:47", "loss": 0.077919, "lr": 0.022310, "mode": "train", "time_backward": 1.058071, "time_data": 0.016843, "time_diff": 1.480822, "time_forward": 0.398920, "time_loss": 0.000279}
[03/28 09:17:13] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "1380", "eta": "4:04:15", "loss": 0.086549, "lr": 0.022326, "mode": "train", "time_backward": 1.054062, "time_data": 0.016750, "time_diff": 1.476557, "time_forward": 0.397967, "time_loss": 0.000460}
[03/28 09:17:34] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "1390", "eta": "4:03:45", "loss": 0.094996, "lr": 0.022343, "mode": "train", "time_backward": 1.069994, "time_data": 0.018686, "time_diff": 1.497786, "time_forward": 0.405579, "time_loss": 0.000343}
[03/28 09:18:06] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "1400", "eta": "4:03:14", "loss": 0.083348, "lr": 0.022359, "mode": "train", "time_backward": 1.092178, "time_data": 0.017592, "time_diff": 1.517945, "time_forward": 0.399977, "time_loss": 0.000442}
[03/28 09:18:21] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "1410", "eta": "4:02:43", "loss": 0.084026, "lr": 0.022375, "mode": "train", "time_backward": 1.056158, "time_data": 0.017152, "time_diff": 1.478867, "time_forward": 0.398608, "time_loss": 0.000249}
[03/28 09:18:45] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "1420", "eta": "4:02:11", "loss": 0.079650, "lr": 0.022392, "mode": "train", "time_backward": 1.091229, "time_data": 0.017729, "time_diff": 1.511643, "time_forward": 0.399226, "time_loss": 0.000271}
[03/28 09:19:04] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "1430", "eta": "4:01:40", "loss": 0.088006, "lr": 0.022408, "mode": "train", "time_backward": 1.056820, "time_data": 0.033039, "time_diff": 1.503617, "time_forward": 0.406885, "time_loss": 0.000372}
[03/28 09:19:21] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "1440", "eta": "4:00:59", "loss": 0.083232, "lr": 0.022424, "mode": "train", "time_backward": 1.069773, "time_data": 0.016649, "time_diff": 1.515601, "time_forward": 0.399143, "time_loss": 0.000306}
[03/28 09:19:43] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "1450", "eta": "3:58:46", "loss": 0.088458, "lr": 0.022441, "mode": "train", "time_backward": 1.055715, "time_data": 0.018080, "time_diff": 1.478866, "time_forward": 0.401174, "time_loss": 0.000673}
[03/28 09:20:32] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "1460", "eta": "3:58:15", "loss": 0.083908, "lr": 0.022457, "mode": "train", "time_backward": 1.055885, "time_data": 0.021915, "time_diff": 1.494134, "time_forward": 0.412802, "time_loss": 0.000299}
[03/28 09:20:47] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "1470", "eta": "3:57:44", "loss": 0.076834, "lr": 0.022473, "mode": "train", "time_backward": 1.064145, "time_data": 0.017754, "time_diff": 1.498225, "time_forward": 0.398337, "time_loss": 0.000240}
[03/28 09:21:11] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "1480", "eta": "3:57:10", "loss": 0.087060, "lr": 0.022490, "mode": "train", "time_backward": 1.056964, "time_data": 0.018046, "time_diff": 1.493795, "time_forward": 0.411776, "time_loss": 0.003744}
[03/28 09:21:39] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "1490", "eta": "3:58:18", "loss": 0.100548, "lr": 0.022506, "mode": "train", "time_backward": 1.056933, "time_data": 10.732672, "time_diff": 12.214873, "time_forward": 0.415834, "time_loss": 0.000862}
[03/28 09:21:54] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "1500", "eta": "3:57:47", "loss": 0.093273, "lr": 0.022522, "mode": "train", "time_backward": 1.065698, "time_data": 0.017388, "time_diff": 1.489914, "time_forward": 0.399437, "time_loss": 0.000413}
[03/28 09:22:21] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "1510", "eta": "3:57:18", "loss": 0.102156, "lr": 0.022539, "mode": "train", "time_backward": 1.056773, "time_data": 0.017523, "time_diff": 1.659388, "time_forward": 0.578394, "time_loss": 0.000306}
[03/28 09:22:56] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "1520", "eta": "3:56:47", "loss": 0.088227, "lr": 0.022555, "mode": "train", "time_backward": 1.085904, "time_data": 0.018010, "time_diff": 1.521261, "time_forward": 0.404611, "time_loss": 0.000299}
[03/28 09:23:18] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "1530", "eta": "3:56:15", "loss": 0.083824, "lr": 0.022571, "mode": "train", "time_backward": 1.058718, "time_data": 0.017161, "time_diff": 1.479522, "time_forward": 0.398598, "time_loss": 0.000254}
[03/28 09:23:53] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "1540", "eta": "3:55:48", "loss": 0.090563, "lr": 0.022588, "mode": "train", "time_backward": 1.228612, "time_data": 0.293490, "time_diff": 2.017460, "time_forward": 0.462348, "time_loss": 0.021714}
[03/28 09:24:08] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "1550", "eta": "3:55:22", "loss": 0.080278, "lr": 0.022604, "mode": "train", "time_backward": 1.262012, "time_data": 0.017923, "time_diff": 2.120129, "time_forward": 0.423392, "time_loss": 0.000253}
[03/28 09:24:27] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "1560", "eta": "3:54:51", "loss": 0.084619, "lr": 0.022620, "mode": "train", "time_backward": 1.100006, "time_data": 0.017235, "time_diff": 1.555582, "time_forward": 0.399832, "time_loss": 0.000321}
[03/28 09:25:08] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "1570", "eta": "3:58:11", "loss": 0.082469, "lr": 0.022637, "mode": "train", "time_backward": 1.173978, "time_data": 24.788069, "time_diff": 26.906712, "time_forward": 0.888777, "time_loss": 0.000419}
[03/28 09:25:23] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "1580", "eta": "3:57:40", "loss": 0.077986, "lr": 0.022653, "mode": "train", "time_backward": 1.056492, "time_data": 0.017215, "time_diff": 1.484210, "time_forward": 0.400916, "time_loss": 0.000790}
[03/28 09:26:51] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "1590", "eta": "4:07:41", "loss": 0.098178, "lr": 0.022669, "mode": "train", "time_backward": 71.002703, "time_data": 0.016971, "time_diff": 71.467988, "time_forward": 0.398656, "time_loss": 0.000285}
[03/28 09:27:06] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "1600", "eta": "4:07:08", "loss": 0.093959, "lr": 0.022685, "mode": "train", "time_backward": 1.054565, "time_data": 0.017893, "time_diff": 1.477034, "time_forward": 0.400833, "time_loss": 0.000311}
[03/28 09:27:22] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "1610", "eta": "4:06:35", "loss": 0.089542, "lr": 0.022702, "mode": "train", "time_backward": 1.092692, "time_data": 0.017451, "time_diff": 1.520273, "time_forward": 0.403754, "time_loss": 0.003403}
[03/28 09:27:44] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "1620", "eta": "4:03:21", "loss": 0.087010, "lr": 0.022718, "mode": "train", "time_backward": 1.054749, "time_data": 0.029300, "time_diff": 1.546671, "time_forward": 0.410219, "time_loss": 0.000393}
[03/28 09:27:59] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "1630", "eta": "4:01:11", "loss": 0.084963, "lr": 0.022734, "mode": "train", "time_backward": 1.195347, "time_data": 0.017263, "time_diff": 1.906248, "time_forward": 0.685502, "time_loss": 0.001048}
[03/28 09:28:25] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "1640", "eta": "4:00:39", "loss": 0.080778, "lr": 0.022751, "mode": "train", "time_backward": 1.054803, "time_data": 0.019605, "time_diff": 1.481770, "time_forward": 0.399987, "time_loss": 0.000641}
[03/28 09:29:04] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "1650", "eta": "4:03:12", "loss": 0.093355, "lr": 0.022767, "mode": "train", "time_backward": 1.104643, "time_data": 20.510814, "time_diff": 22.305995, "time_forward": 0.671359, "time_loss": 0.000435}
[03/28 09:29:19] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "1660", "eta": "4:02:39", "loss": 0.078721, "lr": 0.022783, "mode": "train", "time_backward": 1.084654, "time_data": 0.016890, "time_diff": 1.503044, "time_forward": 0.398255, "time_loss": 0.000299}
[03/28 09:29:41] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "1670", "eta": "4:02:13", "loss": 0.087243, "lr": 0.022800, "mode": "train", "time_backward": 1.245385, "time_data": 0.017151, "time_diff": 2.284188, "time_forward": 0.946330, "time_loss": 0.013409}
[03/28 09:29:59] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "1680", "eta": "4:01:17", "loss": 0.084866, "lr": 0.022816, "mode": "train", "time_backward": 1.053468, "time_data": 0.017083, "time_diff": 1.484443, "time_forward": 0.402520, "time_loss": 0.000504}
[03/28 09:30:46] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "1690", "eta": "4:00:44", "loss": 0.091713, "lr": 0.022832, "mode": "train", "time_backward": 1.056213, "time_data": 0.016880, "time_diff": 1.478270, "time_forward": 0.399012, "time_loss": 0.000279}
[03/28 09:31:46] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "1700", "eta": "3:59:41", "loss": 0.080732, "lr": 0.022849, "mode": "train", "time_backward": 1.055405, "time_data": 0.018076, "time_diff": 1.478478, "time_forward": 0.398496, "time_loss": 0.000285}
[03/28 09:32:52] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "1710", "eta": "3:59:09", "loss": 0.091940, "lr": 0.022865, "mode": "train", "time_backward": 1.074051, "time_data": 0.017230, "time_diff": 1.527604, "time_forward": 0.401941, "time_loss": 0.000789}
[03/28 09:33:07] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "1720", "eta": "3:58:36", "loss": 0.080457, "lr": 0.022881, "mode": "train", "time_backward": 1.106358, "time_data": 0.016804, "time_diff": 1.525168, "time_forward": 0.398479, "time_loss": 0.000267}
[03/28 09:33:23] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "1730", "eta": "3:58:04", "loss": 0.086597, "lr": 0.022898, "mode": "train", "time_backward": 1.054031, "time_data": 0.032673, "time_diff": 1.501793, "time_forward": 0.399669, "time_loss": 0.000345}
[03/28 09:33:38] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "1740", "eta": "3:57:31", "loss": 0.080000, "lr": 0.022914, "mode": "train", "time_backward": 1.056682, "time_data": 0.016910, "time_diff": 1.478125, "time_forward": 0.400860, "time_loss": 0.000516}
[03/28 09:34:05] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "1750", "eta": "3:56:58", "loss": 0.086710, "lr": 0.022930, "mode": "train", "time_backward": 1.071007, "time_data": 0.017066, "time_diff": 1.495992, "time_forward": 0.401481, "time_loss": 0.000294}
[03/28 09:34:20] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "1760", "eta": "3:53:11", "loss": 0.095443, "lr": 0.022947, "mode": "train", "time_backward": 1.061338, "time_data": 0.016970, "time_diff": 1.485608, "time_forward": 0.399102, "time_loss": 0.000241}
[03/28 09:35:19] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "1770", "eta": "3:52:40", "loss": 0.085497, "lr": 0.022963, "mode": "train", "time_backward": 1.129293, "time_data": 0.016909, "time_diff": 1.561894, "time_forward": 0.401372, "time_loss": 0.000302}
[03/28 09:35:34] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "1780", "eta": "3:46:26", "loss": 0.082589, "lr": 0.022979, "mode": "train", "time_backward": 1.083390, "time_data": 0.017362, "time_diff": 1.510154, "time_forward": 0.401787, "time_loss": 0.000275}
[03/28 09:35:50] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "1790", "eta": "3:45:55", "loss": 0.093356, "lr": 0.022996, "mode": "train", "time_backward": 1.135989, "time_data": 0.017789, "time_diff": 1.569379, "time_forward": 0.401078, "time_loss": 0.000840}
[03/28 09:36:19] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "1800", "eta": "3:46:00", "loss": 0.082021, "lr": 0.023012, "mode": "train", "time_backward": 5.297804, "time_data": 0.017922, "time_diff": 5.722058, "time_forward": 0.399465, "time_loss": 0.000249}
[03/28 09:36:52] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "1810", "eta": "3:45:29", "loss": 0.090757, "lr": 0.023028, "mode": "train", "time_backward": 1.151410, "time_data": 0.017932, "time_diff": 1.584210, "time_forward": 0.407667, "time_loss": 0.000364}
[03/28 09:37:07] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "1820", "eta": "3:44:58", "loss": 0.087558, "lr": 0.023045, "mode": "train", "time_backward": 1.088130, "time_data": 0.061903, "time_diff": 1.558699, "time_forward": 0.401591, "time_loss": 0.000270}
[03/28 09:37:34] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "1830", "eta": "3:44:27", "loss": 0.084464, "lr": 0.023061, "mode": "train", "time_backward": 1.067639, "time_data": 0.017100, "time_diff": 1.514525, "time_forward": 0.398970, "time_loss": 0.000280}
[03/28 09:37:50] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "1840", "eta": "3:43:55", "loss": 0.092720, "lr": 0.023077, "mode": "train", "time_backward": 1.130893, "time_data": 0.017975, "time_diff": 1.555479, "time_forward": 0.400056, "time_loss": 0.000355}
[03/28 09:38:06] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "1850", "eta": "3:43:24", "loss": 0.091989, "lr": 0.023094, "mode": "train", "time_backward": 1.052597, "time_data": 0.016816, "time_diff": 1.474650, "time_forward": 0.398031, "time_loss": 0.000278}
[03/28 09:38:29] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "1860", "eta": "3:33:29", "loss": 0.082401, "lr": 0.023110, "mode": "train", "time_backward": 1.106345, "time_data": 0.017017, "time_diff": 1.525544, "time_forward": 0.398991, "time_loss": 0.000350}
[03/28 09:38:47] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "1870", "eta": "3:32:58", "loss": 0.092626, "lr": 0.023126, "mode": "train", "time_backward": 1.068364, "time_data": 0.018316, "time_diff": 1.503642, "time_forward": 0.416294, "time_loss": 0.000350}
[03/28 09:39:31] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "1880", "eta": "3:32:28", "loss": 0.096977, "lr": 0.023143, "mode": "train", "time_backward": 1.059216, "time_data": 0.017143, "time_diff": 1.518854, "time_forward": 0.438494, "time_loss": 0.000740}
[03/28 09:40:05] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "1890", "eta": "3:34:32", "loss": 0.089984, "lr": 0.023159, "mode": "train", "time_backward": 19.437624, "time_data": 0.016841, "time_diff": 19.859268, "time_forward": 0.398530, "time_loss": 0.000248}
[03/28 09:40:29] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "1900", "eta": "3:34:01", "loss": 0.084340, "lr": 0.023175, "mode": "train", "time_backward": 1.065020, "time_data": 0.019228, "time_diff": 1.490152, "time_forward": 0.402511, "time_loss": 0.000252}
[03/28 09:40:50] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "1910", "eta": "3:33:30", "loss": 0.086990, "lr": 0.023192, "mode": "train", "time_backward": 1.058150, "time_data": 0.017606, "time_diff": 1.486908, "time_forward": 0.407189, "time_loss": 0.000278}
[03/28 09:41:05] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "1920", "eta": "3:33:00", "loss": 0.093095, "lr": 0.023208, "mode": "train", "time_backward": 1.058037, "time_data": 0.019239, "time_diff": 1.481536, "time_forward": 0.398163, "time_loss": 0.000248}
[03/28 09:42:07] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "1930", "eta": "3:32:30", "loss": 0.084506, "lr": 0.023224, "mode": "train", "time_backward": 1.077678, "time_data": 0.017527, "time_diff": 1.566785, "time_forward": 0.399571, "time_loss": 0.001428}
[03/28 09:42:23] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "1940", "eta": "3:30:10", "loss": 0.089757, "lr": 0.023241, "mode": "train", "time_backward": 1.055596, "time_data": 0.017434, "time_diff": 1.480889, "time_forward": 0.399863, "time_loss": 0.000377}
[03/28 09:43:22] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "1950", "eta": "3:29:41", "loss": 0.084817, "lr": 0.023257, "mode": "train", "time_backward": 1.057621, "time_data": 0.017543, "time_diff": 1.569466, "time_forward": 0.490653, "time_loss": 0.000451}
[03/28 09:43:58] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "1960", "eta": "3:29:11", "loss": 0.081329, "lr": 0.023273, "mode": "train", "time_backward": 1.102504, "time_data": 0.018159, "time_diff": 1.559263, "time_forward": 0.402482, "time_loss": 0.000260}
[03/28 09:45:03] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "1970", "eta": "3:28:40", "loss": 0.076120, "lr": 0.023290, "mode": "train", "time_backward": 1.095112, "time_data": 0.017896, "time_diff": 1.523926, "time_forward": 0.403022, "time_loss": 0.000468}
[03/28 09:45:18] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "1980", "eta": "3:27:41", "loss": 0.079254, "lr": 0.023306, "mode": "train", "time_backward": 1.060582, "time_data": 0.016947, "time_diff": 1.484241, "time_forward": 0.398379, "time_loss": 0.000240}
[03/28 09:45:33] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "1990", "eta": "3:27:12", "loss": 0.083790, "lr": 0.023322, "mode": "train", "time_backward": 1.109987, "time_data": 0.017366, "time_diff": 1.552707, "time_forward": 0.417095, "time_loss": 0.000373}
[03/28 09:45:50] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "2000", "eta": "3:26:48", "loss": 0.081382, "lr": 0.023339, "mode": "train", "time_backward": 1.059790, "time_data": 0.019088, "time_diff": 2.330002, "time_forward": 1.243293, "time_loss": 0.000456}
[03/28 09:46:38] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "2010", "eta": "3:26:17", "loss": 0.084543, "lr": 0.023355, "mode": "train", "time_backward": 1.058106, "time_data": 0.018424, "time_diff": 1.484430, "time_forward": 0.399413, "time_loss": 0.000258}
[03/28 09:46:53] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "2020", "eta": "3:24:28", "loss": 0.086351, "lr": 0.023371, "mode": "train", "time_backward": 1.062501, "time_data": 0.017875, "time_diff": 1.482357, "time_forward": 0.399453, "time_loss": 0.000275}
[03/28 09:47:42] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "2030", "eta": "3:23:58", "loss": 0.083786, "lr": 0.023387, "mode": "train", "time_backward": 1.065734, "time_data": 0.017941, "time_diff": 1.490360, "time_forward": 0.399572, "time_loss": 0.000292}
[03/28 09:48:07] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "2040", "eta": "3:23:28", "loss": 0.079386, "lr": 0.023404, "mode": "train", "time_backward": 1.056759, "time_data": 0.016847, "time_diff": 1.479996, "time_forward": 0.398289, "time_loss": 0.000310}
[03/28 09:48:22] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "2050", "eta": "3:22:58", "loss": 0.093855, "lr": 0.023420, "mode": "train", "time_backward": 1.055243, "time_data": 0.017483, "time_diff": 1.479549, "time_forward": 0.399195, "time_loss": 0.000296}
[03/28 09:49:13] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "2060", "eta": "3:22:28", "loss": 0.084552, "lr": 0.023436, "mode": "train", "time_backward": 1.052872, "time_data": 0.017586, "time_diff": 1.474542, "time_forward": 0.398227, "time_loss": 0.000222}
[03/28 09:49:28] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "2070", "eta": "3:21:58", "loss": 0.092500, "lr": 0.023453, "mode": "train", "time_backward": 1.055557, "time_data": 0.017125, "time_diff": 1.483963, "time_forward": 0.399903, "time_loss": 0.000406}
[03/28 09:49:56] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "2080", "eta": "3:21:28", "loss": 0.088807, "lr": 0.023469, "mode": "train", "time_backward": 1.065779, "time_data": 0.016831, "time_diff": 1.507283, "time_forward": 0.402349, "time_loss": 0.000315}
[03/28 09:50:21] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "2090", "eta": "3:20:57", "loss": 0.078585, "lr": 0.023485, "mode": "train", "time_backward": 1.056680, "time_data": 0.017372, "time_diff": 1.478421, "time_forward": 0.399826, "time_loss": 0.000225}
[03/28 09:50:51] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "2100", "eta": "3:19:44", "loss": 0.086616, "lr": 0.023502, "mode": "train", "time_backward": 1.085974, "time_data": 0.016690, "time_diff": 1.516298, "time_forward": 0.398539, "time_loss": 0.000234}
[03/28 09:51:14] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "2110", "eta": "3:19:13", "loss": 0.074039, "lr": 0.023518, "mode": "train", "time_backward": 1.056236, "time_data": 0.019232, "time_diff": 1.483806, "time_forward": 0.399211, "time_loss": 0.000296}
[03/28 09:52:11] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "2120", "eta": "3:18:44", "loss": 0.083550, "lr": 0.023534, "mode": "train", "time_backward": 1.103712, "time_data": 0.017341, "time_diff": 1.529461, "time_forward": 0.400759, "time_loss": 0.000377}
[03/28 09:52:26] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "2130", "eta": "3:18:14", "loss": 0.082719, "lr": 0.023551, "mode": "train", "time_backward": 1.056862, "time_data": 0.017295, "time_diff": 1.479171, "time_forward": 0.399154, "time_loss": 0.000244}
[03/28 09:53:00] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "2140", "eta": "3:17:42", "loss": 0.086460, "lr": 0.023567, "mode": "train", "time_backward": 1.065257, "time_data": 0.018634, "time_diff": 1.539361, "time_forward": 0.443917, "time_loss": 0.000417}
[03/28 09:53:15] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "2150", "eta": "3:17:13", "loss": 0.092950, "lr": 0.023583, "mode": "train", "time_backward": 1.062971, "time_data": 0.017194, "time_diff": 1.496918, "time_forward": 0.401650, "time_loss": 0.000663}
[03/28 09:53:44] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "2160", "eta": "3:16:43", "loss": 0.087694, "lr": 0.023600, "mode": "train", "time_backward": 1.102107, "time_data": 0.016988, "time_diff": 1.528505, "time_forward": 0.401026, "time_loss": 0.000938}
[03/28 09:54:19] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "2170", "eta": "3:16:13", "loss": 0.084635, "lr": 0.023616, "mode": "train", "time_backward": 1.054582, "time_data": 0.017699, "time_diff": 1.481157, "time_forward": 0.404053, "time_loss": 0.000272}
[03/28 09:54:35] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "2180", "eta": "3:15:43", "loss": 0.089567, "lr": 0.023632, "mode": "train", "time_backward": 1.069403, "time_data": 0.020921, "time_diff": 1.510211, "time_forward": 0.413102, "time_loss": 0.000499}
[03/28 09:55:08] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "2190", "eta": "3:15:13", "loss": 0.081615, "lr": 0.023649, "mode": "train", "time_backward": 1.052118, "time_data": 0.016562, "time_diff": 1.473467, "time_forward": 0.397249, "time_loss": 0.000198}
[03/28 09:55:47] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "2200", "eta": "3:14:44", "loss": 0.087638, "lr": 0.023665, "mode": "train", "time_backward": 1.112485, "time_data": 0.017114, "time_diff": 1.532276, "time_forward": 0.401723, "time_loss": 0.000383}
[03/28 09:56:02] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "2210", "eta": "3:14:13", "loss": 0.089447, "lr": 0.023681, "mode": "train", "time_backward": 1.055278, "time_data": 0.019702, "time_diff": 1.480038, "time_forward": 0.398035, "time_loss": 0.000257}
[03/28 09:56:17] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "2220", "eta": "3:12:28", "loss": 0.089601, "lr": 0.023698, "mode": "train", "time_backward": 1.054509, "time_data": 0.016973, "time_diff": 1.476825, "time_forward": 0.398509, "time_loss": 0.000359}
[03/28 09:57:20] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "2230", "eta": "3:12:42", "loss": 0.087091, "lr": 0.023714, "mode": "train", "time_backward": 6.647301, "time_data": 0.021230, "time_diff": 7.111911, "time_forward": 0.434599, "time_loss": 0.000372}
[03/28 09:58:44] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "2240", "eta": "3:12:12", "loss": 0.081059, "lr": 0.023730, "mode": "train", "time_backward": 1.059192, "time_data": 0.018984, "time_diff": 1.491479, "time_forward": 0.400432, "time_loss": 0.000260}
[03/28 09:59:00] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "2250", "eta": "3:11:42", "loss": 0.083180, "lr": 0.023747, "mode": "train", "time_backward": 1.068543, "time_data": 0.016759, "time_diff": 1.491451, "time_forward": 0.398017, "time_loss": 0.000236}
[03/28 09:59:15] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "2260", "eta": "3:11:13", "loss": 0.084181, "lr": 0.023763, "mode": "train", "time_backward": 1.128596, "time_data": 0.022426, "time_diff": 1.565351, "time_forward": 0.406473, "time_loss": 0.000242}
[03/28 09:59:32] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "2270", "eta": "3:10:43", "loss": 0.073440, "lr": 0.023779, "mode": "train", "time_backward": 1.053337, "time_data": 0.018227, "time_diff": 1.558799, "time_forward": 0.472303, "time_loss": 0.000230}
[03/28 09:59:48] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "2280", "eta": "3:10:13", "loss": 0.088431, "lr": 0.023796, "mode": "train", "time_backward": 1.058716, "time_data": 0.017548, "time_diff": 1.518329, "time_forward": 0.415497, "time_loss": 0.000461}
[03/28 10:00:03] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "2290", "eta": "3:09:43", "loss": 0.078120, "lr": 0.023812, "mode": "train", "time_backward": 1.093606, "time_data": 0.018237, "time_diff": 1.513470, "time_forward": 0.397921, "time_loss": 0.000238}
[03/28 10:00:23] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "2300", "eta": "3:09:13", "loss": 0.096103, "lr": 0.023828, "mode": "train", "time_backward": 1.111671, "time_data": 0.017421, "time_diff": 1.535692, "time_forward": 0.398845, "time_loss": 0.000312}
[03/28 10:00:38] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "2310", "eta": "3:08:43", "loss": 0.087350, "lr": 0.023845, "mode": "train", "time_backward": 1.086483, "time_data": 0.017890, "time_diff": 1.505930, "time_forward": 0.398189, "time_loss": 0.000301}
[03/28 10:00:54] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "2320", "eta": "3:08:13", "loss": 0.088846, "lr": 0.023861, "mode": "train", "time_backward": 1.130518, "time_data": 0.018826, "time_diff": 1.551185, "time_forward": 0.398321, "time_loss": 0.000266}
[03/28 10:01:11] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "2330", "eta": "3:07:43", "loss": 0.085651, "lr": 0.023877, "mode": "train", "time_backward": 1.072728, "time_data": 0.018585, "time_diff": 1.504866, "time_forward": 0.401496, "time_loss": 0.000347}
[03/28 10:01:27] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "2340", "eta": "3:07:13", "loss": 0.083185, "lr": 0.023894, "mode": "train", "time_backward": 1.054610, "time_data": 0.017086, "time_diff": 1.479148, "time_forward": 0.399345, "time_loss": 0.000312}
[03/28 10:01:45] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "2350", "eta": "3:04:57", "loss": 0.072532, "lr": 0.023910, "mode": "train", "time_backward": 1.093833, "time_data": 0.020896, "time_diff": 1.516205, "time_forward": 0.400718, "time_loss": 0.000431}
[03/28 10:02:05] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "2360", "eta": "3:04:27", "loss": 0.085997, "lr": 0.023926, "mode": "train", "time_backward": 1.069485, "time_data": 0.017142, "time_diff": 1.511078, "time_forward": 0.401423, "time_loss": 0.000384}
[03/28 10:02:21] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "2370", "eta": "3:03:57", "loss": 0.078423, "lr": 0.023943, "mode": "train", "time_backward": 1.108293, "time_data": 0.017887, "time_diff": 1.545374, "time_forward": 0.400082, "time_loss": 0.000343}
[03/28 10:02:40] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "2380", "eta": "3:03:27", "loss": 0.081114, "lr": 0.023959, "mode": "train", "time_backward": 1.059590, "time_data": 0.020036, "time_diff": 1.488613, "time_forward": 0.401449, "time_loss": 0.000384}
[03/28 10:02:56] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "2390", "eta": "3:02:57", "loss": 0.083041, "lr": 0.023975, "mode": "train", "time_backward": 1.053997, "time_data": 0.016815, "time_diff": 1.481982, "time_forward": 0.399607, "time_loss": 0.000653}
[03/28 10:03:18] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "2400", "eta": "3:02:28", "loss": 0.083253, "lr": 0.023992, "mode": "train", "time_backward": 1.135479, "time_data": 0.020154, "time_diff": 1.568443, "time_forward": 0.400298, "time_loss": 0.000362}
[03/28 10:03:33] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "2410", "eta": "3:01:58", "loss": 0.073015, "lr": 0.024008, "mode": "train", "time_backward": 1.055987, "time_data": 0.017901, "time_diff": 1.485673, "time_forward": 0.400118, "time_loss": 0.000404}
[03/28 10:03:54] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "2420", "eta": "3:01:29", "loss": 0.078489, "lr": 0.024024, "mode": "train", "time_backward": 1.070841, "time_data": 0.017043, "time_diff": 1.514095, "time_forward": 0.419014, "time_loss": 0.000287}
[03/28 10:04:14] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "2430", "eta": "3:00:18", "loss": 0.085536, "lr": 0.024041, "mode": "train", "time_backward": 1.079092, "time_data": 0.016751, "time_diff": 1.519815, "time_forward": 0.409297, "time_loss": 0.000218}
[03/28 10:04:34] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "2440", "eta": "2:59:48", "loss": 0.083399, "lr": 0.024057, "mode": "train", "time_backward": 1.056498, "time_data": 0.017093, "time_diff": 1.481250, "time_forward": 0.398605, "time_loss": 0.000427}
[03/28 10:05:06] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "2450", "eta": "2:59:17", "loss": 0.090748, "lr": 0.024073, "mode": "train", "time_backward": 1.078283, "time_data": 0.016737, "time_diff": 1.513689, "time_forward": 0.400511, "time_loss": 0.000264}
[03/28 10:05:40] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "2460", "eta": "2:58:47", "loss": 0.079638, "lr": 0.024089, "mode": "train", "time_backward": 1.063166, "time_data": 0.018402, "time_diff": 1.489974, "time_forward": 0.400624, "time_loss": 0.000355}
[03/28 10:05:55] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "2470", "eta": "2:58:17", "loss": 0.077440, "lr": 0.024106, "mode": "train", "time_backward": 1.056583, "time_data": 0.018009, "time_diff": 1.480909, "time_forward": 0.399244, "time_loss": 0.000271}
[03/28 10:06:17] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "2480", "eta": "2:57:49", "loss": 0.089637, "lr": 0.024122, "mode": "train", "time_backward": 1.140068, "time_data": 0.017128, "time_diff": 1.585977, "time_forward": 0.399861, "time_loss": 0.000357}
[03/28 10:06:34] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "2490", "eta": "2:57:20", "loss": 0.080167, "lr": 0.024138, "mode": "train", "time_backward": 1.137140, "time_data": 0.021940, "time_diff": 1.565296, "time_forward": 0.402536, "time_loss": 0.000240}
[03/28 10:06:57] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "2500", "eta": "2:56:50", "loss": 0.096026, "lr": 0.024155, "mode": "train", "time_backward": 1.060832, "time_data": 0.018107, "time_diff": 1.490402, "time_forward": 0.403448, "time_loss": 0.000307}
[03/28 10:07:34] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "2510", "eta": "2:56:01", "loss": 0.086927, "lr": 0.024171, "mode": "train", "time_backward": 1.058065, "time_data": 0.020877, "time_diff": 1.514335, "time_forward": 0.403495, "time_loss": 0.000398}
[03/28 10:07:50] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "2520", "eta": "2:55:32", "loss": 0.084403, "lr": 0.024187, "mode": "train", "time_backward": 1.102958, "time_data": 0.027941, "time_diff": 1.541106, "time_forward": 0.399568, "time_loss": 0.000272}
[03/28 10:08:07] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "2530", "eta": "2:55:03", "loss": 0.086087, "lr": 0.024204, "mode": "train", "time_backward": 1.131076, "time_data": 0.017286, "time_diff": 1.550130, "time_forward": 0.398255, "time_loss": 0.000242}
[03/28 10:08:22] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "2540", "eta": "2:54:34", "loss": 0.081938, "lr": 0.024220, "mode": "train", "time_backward": 1.068419, "time_data": 0.017057, "time_diff": 1.483687, "time_forward": 0.397653, "time_loss": 0.000241}
[03/28 10:08:38] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "2550", "eta": "2:54:04", "loss": 0.085211, "lr": 0.024236, "mode": "train", "time_backward": 1.053047, "time_data": 0.018697, "time_diff": 1.557214, "time_forward": 0.462956, "time_loss": 0.000368}
[03/28 10:08:54] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "2560", "eta": "2:53:34", "loss": 0.076338, "lr": 0.024253, "mode": "train", "time_backward": 1.100625, "time_data": 0.017582, "time_diff": 1.534026, "time_forward": 0.404281, "time_loss": 0.000245}
[03/28 10:09:09] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "2570", "eta": "2:53:04", "loss": 0.072429, "lr": 0.024269, "mode": "train", "time_backward": 1.108035, "time_data": 0.017803, "time_diff": 1.529558, "time_forward": 0.400046, "time_loss": 0.000319}
[03/28 10:09:25] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "2580", "eta": "2:52:35", "loss": 0.090547, "lr": 0.024285, "mode": "train", "time_backward": 1.075547, "time_data": 0.017220, "time_diff": 1.517833, "time_forward": 0.404353, "time_loss": 0.000410}
[03/28 10:09:45] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "2590", "eta": "2:52:09", "loss": 0.088699, "lr": 0.024302, "mode": "train", "time_backward": 1.130365, "time_data": 0.042184, "time_diff": 1.924952, "time_forward": 0.732849, "time_loss": 0.016193}
[03/28 10:10:00] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "2600", "eta": "2:51:39", "loss": 0.083962, "lr": 0.024318, "mode": "train", "time_backward": 1.068596, "time_data": 0.017730, "time_diff": 1.501503, "time_forward": 0.403017, "time_loss": 0.000276}
[03/28 10:10:18] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "2610", "eta": "2:51:10", "loss": 0.084530, "lr": 0.024334, "mode": "train", "time_backward": 1.074558, "time_data": 0.091264, "time_diff": 1.583127, "time_forward": 0.415907, "time_loss": 0.000219}
[03/28 10:10:33] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "2620", "eta": "2:50:40", "loss": 0.085290, "lr": 0.024351, "mode": "train", "time_backward": 1.053920, "time_data": 0.019391, "time_diff": 1.478755, "time_forward": 0.398109, "time_loss": 0.000306}
[03/28 10:10:49] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "2630", "eta": "2:50:11", "loss": 0.077034, "lr": 0.024367, "mode": "train", "time_backward": 1.079107, "time_data": 0.017867, "time_diff": 1.546327, "time_forward": 0.442167, "time_loss": 0.000257}
[03/28 10:11:13] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "2640", "eta": "2:49:41", "loss": 0.075714, "lr": 0.024383, "mode": "train", "time_backward": 1.095123, "time_data": 0.017519, "time_diff": 1.515651, "time_forward": 0.400061, "time_loss": 0.000244}
[03/28 10:11:28] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "2650", "eta": "2:49:11", "loss": 0.071535, "lr": 0.024400, "mode": "train", "time_backward": 1.066721, "time_data": 0.035156, "time_diff": 1.512242, "time_forward": 0.398875, "time_loss": 0.000277}
[03/28 10:11:44] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "2660", "eta": "2:48:42", "loss": 0.085656, "lr": 0.024416, "mode": "train", "time_backward": 1.087572, "time_data": 0.036876, "time_diff": 1.580756, "time_forward": 0.399487, "time_loss": 0.000239}
[03/28 10:12:00] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "2670", "eta": "2:48:12", "loss": 0.082853, "lr": 0.024432, "mode": "train", "time_backward": 1.141109, "time_data": 0.017921, "time_diff": 1.597668, "time_forward": 0.414430, "time_loss": 0.016933}
[03/28 10:12:16] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "2680", "eta": "2:47:42", "loss": 0.084932, "lr": 0.024449, "mode": "train", "time_backward": 1.072241, "time_data": 0.017796, "time_diff": 1.523789, "time_forward": 0.400010, "time_loss": 0.000347}
[03/28 10:12:32] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "2690", "eta": "2:47:14", "loss": 0.075268, "lr": 0.024465, "mode": "train", "time_backward": 1.153275, "time_data": 0.022397, "time_diff": 1.587328, "time_forward": 0.408517, "time_loss": 0.000247}
[03/28 10:12:50] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "2700", "eta": "2:46:44", "loss": 0.081677, "lr": 0.024481, "mode": "train", "time_backward": 1.087577, "time_data": 0.020561, "time_diff": 1.560694, "time_forward": 0.430165, "time_loss": 0.000317}
[03/28 10:13:17] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "2710", "eta": "2:46:15", "loss": 0.092920, "lr": 0.024498, "mode": "train", "time_backward": 1.057474, "time_data": 0.059619, "time_diff": 1.593041, "time_forward": 0.468044, "time_loss": 0.000250}
[03/28 10:13:33] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "2720", "eta": "2:45:46", "loss": 0.092571, "lr": 0.024514, "mode": "train", "time_backward": 1.066579, "time_data": 0.017441, "time_diff": 1.544319, "time_forward": 0.398843, "time_loss": 0.000338}
[03/28 10:13:48] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "2730", "eta": "2:45:16", "loss": 0.080414, "lr": 0.024530, "mode": "train", "time_backward": 1.054271, "time_data": 0.016926, "time_diff": 1.481743, "time_forward": 0.398992, "time_loss": 0.000333}
[03/28 10:14:05] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "2740", "eta": "2:44:46", "loss": 0.089505, "lr": 0.024547, "mode": "train", "time_backward": 1.080632, "time_data": 0.068707, "time_diff": 1.615145, "time_forward": 0.434008, "time_loss": 0.000414}
[03/28 10:14:21] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "2750", "eta": "2:44:19", "loss": 0.081493, "lr": 0.024563, "mode": "train", "time_backward": 1.096291, "time_data": 0.017326, "time_diff": 1.775657, "time_forward": 0.449019, "time_loss": 0.000613}
[03/28 10:14:39] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "2760", "eta": "2:43:50", "loss": 0.086568, "lr": 0.024579, "mode": "train", "time_backward": 1.085509, "time_data": 0.023444, "time_diff": 1.531068, "time_forward": 0.407974, "time_loss": 0.000801}
[03/28 10:14:54] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "2770", "eta": "2:43:19", "loss": 0.078475, "lr": 0.024596, "mode": "train", "time_backward": 1.057428, "time_data": 0.018128, "time_diff": 1.487577, "time_forward": 0.408578, "time_loss": 0.000266}
[03/28 10:15:19] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "2780", "eta": "2:42:50", "loss": 0.082490, "lr": 0.024612, "mode": "train", "time_backward": 1.069550, "time_data": 0.020216, "time_diff": 1.508220, "time_forward": 0.414969, "time_loss": 0.000367}
[03/28 10:15:37] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "2790", "eta": "2:42:20", "loss": 0.085530, "lr": 0.024628, "mode": "train", "time_backward": 1.069396, "time_data": 0.017952, "time_diff": 1.507254, "time_forward": 0.400669, "time_loss": 0.000404}
[03/28 10:15:54] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "2800", "eta": "2:41:51", "loss": 0.082571, "lr": 0.024645, "mode": "train", "time_backward": 1.101670, "time_data": 0.017483, "time_diff": 1.527363, "time_forward": 0.407374, "time_loss": 0.000406}
[03/28 10:16:20] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "2810", "eta": "2:42:35", "loss": 0.088915, "lr": 0.024661, "mode": "train", "time_backward": 12.216149, "time_data": 0.020779, "time_diff": 12.649415, "time_forward": 0.407716, "time_loss": 0.000293}
[03/28 10:16:36] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "2820", "eta": "2:42:06", "loss": 0.083580, "lr": 0.024677, "mode": "train", "time_backward": 1.081770, "time_data": 0.017427, "time_diff": 1.501846, "time_forward": 0.398967, "time_loss": 0.000352}
[03/28 10:16:55] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "2830", "eta": "2:41:36", "loss": 0.087884, "lr": 0.024694, "mode": "train", "time_backward": 1.075954, "time_data": 0.017431, "time_diff": 1.521072, "time_forward": 0.399829, "time_loss": 0.000258}
[03/28 10:17:10] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "2840", "eta": "2:41:07", "loss": 0.077103, "lr": 0.024710, "mode": "train", "time_backward": 1.072275, "time_data": 0.017151, "time_diff": 1.502477, "time_forward": 0.398350, "time_loss": 0.000213}
[03/28 10:17:30] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "2850", "eta": "2:40:38", "loss": 0.093079, "lr": 0.024726, "mode": "train", "time_backward": 1.086888, "time_data": 0.017457, "time_diff": 1.595891, "time_forward": 0.482012, "time_loss": 0.000313}
[03/28 10:18:13] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "2860", "eta": "2:40:08", "loss": 0.087555, "lr": 0.024743, "mode": "train", "time_backward": 1.060585, "time_data": 0.017107, "time_diff": 1.483378, "time_forward": 0.398989, "time_loss": 0.000368}
[03/28 10:19:52] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "2870", "eta": "2:39:39", "loss": 0.080119, "lr": 0.024759, "mode": "train", "time_backward": 1.056535, "time_data": 0.022989, "time_diff": 1.584208, "time_forward": 0.500229, "time_loss": 0.000277}
[03/28 10:20:07] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "2880", "eta": "2:39:09", "loss": 0.085010, "lr": 0.024775, "mode": "train", "time_backward": 1.068392, "time_data": 0.019457, "time_diff": 1.514633, "time_forward": 0.423365, "time_loss": 0.000550}
[03/28 10:20:22] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "2890", "eta": "2:38:39", "loss": 0.079971, "lr": 0.024791, "mode": "train", "time_backward": 1.095819, "time_data": 0.021122, "time_diff": 1.542431, "time_forward": 0.421684, "time_loss": 0.000254}
[03/28 10:20:38] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "2900", "eta": "2:38:10", "loss": 0.083701, "lr": 0.024808, "mode": "train", "time_backward": 1.056601, "time_data": 0.023682, "time_diff": 1.498011, "time_forward": 0.409857, "time_loss": 0.000348}
[03/28 10:20:54] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "2910", "eta": "2:37:41", "loss": 0.081589, "lr": 0.024824, "mode": "train", "time_backward": 1.129150, "time_data": 0.017733, "time_diff": 1.568481, "time_forward": 0.412105, "time_loss": 0.000234}
[03/28 10:21:10] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "2920", "eta": "2:37:11", "loss": 0.075716, "lr": 0.024840, "mode": "train", "time_backward": 1.153286, "time_data": 0.017648, "time_diff": 1.573619, "time_forward": 0.399344, "time_loss": 0.000276}
[03/28 10:21:26] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "2930", "eta": "2:36:42", "loss": 0.082186, "lr": 0.024857, "mode": "train", "time_backward": 1.067606, "time_data": 0.019724, "time_diff": 1.497204, "time_forward": 0.406308, "time_loss": 0.000261}
[03/28 10:21:42] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "2940", "eta": "2:36:12", "loss": 0.085729, "lr": 0.024873, "mode": "train", "time_backward": 1.065180, "time_data": 0.017347, "time_diff": 1.490878, "time_forward": 0.401090, "time_loss": 0.000267}
[03/28 10:21:58] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "2950", "eta": "2:35:43", "loss": 0.082461, "lr": 0.024889, "mode": "train", "time_backward": 1.165342, "time_data": 0.016963, "time_diff": 1.696030, "time_forward": 0.414213, "time_loss": 0.000275}
[03/28 10:22:14] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "2960", "eta": "2:35:13", "loss": 0.083896, "lr": 0.024906, "mode": "train", "time_backward": 1.124276, "time_data": 0.016884, "time_diff": 1.554260, "time_forward": 0.400301, "time_loss": 0.000347}
[03/28 10:22:30] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "2970", "eta": "2:34:44", "loss": 0.083113, "lr": 0.024922, "mode": "train", "time_backward": 1.078650, "time_data": 0.024413, "time_diff": 1.558678, "time_forward": 0.446519, "time_loss": 0.000254}
[03/28 10:22:45] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "2980", "eta": "2:34:14", "loss": 0.091070, "lr": 0.024938, "mode": "train", "time_backward": 1.104470, "time_data": 0.017183, "time_diff": 1.531860, "time_forward": 0.399469, "time_loss": 0.000357}
[03/28 10:23:07] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "2990", "eta": "2:33:44", "loss": 0.075818, "lr": 0.024955, "mode": "train", "time_backward": 1.089745, "time_data": 0.018428, "time_diff": 1.513535, "time_forward": 0.399733, "time_loss": 0.000335}
[03/28 10:23:22] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "3000", "eta": "2:33:14", "loss": 0.087458, "lr": 0.024971, "mode": "train", "time_backward": 1.056109, "time_data": 0.016904, "time_diff": 1.484356, "time_forward": 0.399022, "time_loss": 0.000262}
[03/28 10:23:38] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "3010", "eta": "2:32:44", "loss": 0.084074, "lr": 0.024987, "mode": "train", "time_backward": 1.055028, "time_data": 0.016969, "time_diff": 1.477947, "time_forward": 0.399944, "time_loss": 0.000339}
[03/28 10:24:14] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "3020", "eta": "2:32:14", "loss": 0.086778, "lr": 0.025004, "mode": "train", "time_backward": 1.070446, "time_data": 0.017802, "time_diff": 1.505890, "time_forward": 0.414128, "time_loss": 0.000242}
[03/28 10:24:30] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "3030", "eta": "2:31:45", "loss": 0.079212, "lr": 0.025020, "mode": "train", "time_backward": 1.063204, "time_data": 0.017224, "time_diff": 1.529586, "time_forward": 0.400178, "time_loss": 0.000354}
[03/28 10:24:46] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "3040", "eta": "2:31:15", "loss": 0.082716, "lr": 0.025036, "mode": "train", "time_backward": 1.063302, "time_data": 0.017828, "time_diff": 1.484431, "time_forward": 0.399498, "time_loss": 0.000385}
[03/28 10:25:25] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "3050", "eta": "2:30:45", "loss": 0.079539, "lr": 0.025053, "mode": "train", "time_backward": 1.093240, "time_data": 0.021254, "time_diff": 1.544863, "time_forward": 0.429633, "time_loss": 0.000428}
[03/28 10:25:42] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "3060", "eta": "2:30:16", "loss": 0.081481, "lr": 0.025069, "mode": "train", "time_backward": 1.079032, "time_data": 0.034256, "time_diff": 1.607421, "time_forward": 0.470312, "time_loss": 0.000259}
[03/28 10:25:58] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "3070", "eta": "2:28:16", "loss": 0.082871, "lr": 0.025085, "mode": "train", "time_backward": 1.059781, "time_data": 0.017700, "time_diff": 1.479606, "time_forward": 0.398792, "time_loss": 0.000266}
[03/28 10:26:14] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "3080", "eta": "2:27:47", "loss": 0.090140, "lr": 0.025102, "mode": "train", "time_backward": 1.140525, "time_data": 0.017662, "time_diff": 1.560030, "time_forward": 0.398262, "time_loss": 0.000243}
[03/28 10:26:48] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "3090", "eta": "2:27:17", "loss": 0.082619, "lr": 0.025118, "mode": "train", "time_backward": 1.065055, "time_data": 0.017512, "time_diff": 1.506409, "time_forward": 0.420094, "time_loss": 0.000395}
[03/28 10:27:04] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "3100", "eta": "2:26:48", "loss": 0.082891, "lr": 0.025134, "mode": "train", "time_backward": 1.061504, "time_data": 0.044651, "time_diff": 1.512503, "time_forward": 0.402605, "time_loss": 0.000358}
[03/28 10:27:39] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "3110", "eta": "2:26:19", "loss": 0.082574, "lr": 0.025151, "mode": "train", "time_backward": 1.095201, "time_data": 0.019749, "time_diff": 1.536120, "time_forward": 0.415976, "time_loss": 0.000785}
[03/28 10:27:55] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "3120", "eta": "2:25:49", "loss": 0.076744, "lr": 0.025167, "mode": "train", "time_backward": 1.113592, "time_data": 0.019160, "time_diff": 1.545402, "time_forward": 0.403012, "time_loss": 0.000258}
[03/28 10:28:14] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "3130", "eta": "2:25:42", "loss": 0.078531, "lr": 0.025183, "mode": "train", "time_backward": 1.674126, "time_data": 0.529675, "time_diff": 5.255992, "time_forward": 2.951187, "time_loss": 0.078062}
[03/28 10:28:30] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "3140", "eta": "2:25:13", "loss": 0.084005, "lr": 0.025200, "mode": "train", "time_backward": 1.052207, "time_data": 0.016814, "time_diff": 1.477109, "time_forward": 0.398010, "time_loss": 0.000333}
[03/28 10:28:45] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "3150", "eta": "2:24:44", "loss": 0.083540, "lr": 0.025216, "mode": "train", "time_backward": 1.146631, "time_data": 0.018323, "time_diff": 1.598430, "time_forward": 0.405578, "time_loss": 0.021204}
[03/28 10:29:02] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "3160", "eta": "2:24:15", "loss": 0.072826, "lr": 0.025232, "mode": "train", "time_backward": 1.084660, "time_data": 0.018450, "time_diff": 1.802845, "time_forward": 0.662460, "time_loss": 0.034817}
[03/28 10:29:17] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "3170", "eta": "2:23:45", "loss": 0.079217, "lr": 0.025249, "mode": "train", "time_backward": 1.064423, "time_data": 0.019495, "time_diff": 1.517335, "time_forward": 0.399712, "time_loss": 0.003008}
[03/28 10:29:33] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "3180", "eta": "2:23:16", "loss": 0.083198, "lr": 0.025265, "mode": "train", "time_backward": 1.178331, "time_data": 0.020705, "time_diff": 1.611375, "time_forward": 0.409053, "time_loss": 0.000348}
[03/28 10:29:49] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "3190", "eta": "2:22:46", "loss": 0.087554, "lr": 0.025281, "mode": "train", "time_backward": 1.054567, "time_data": 0.036808, "time_diff": 1.501201, "time_forward": 0.405741, "time_loss": 0.000244}
[03/28 10:30:15] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "3200", "eta": "2:22:24", "loss": 0.073627, "lr": 0.025298, "mode": "train", "time_backward": 1.948824, "time_data": 0.017329, "time_diff": 2.816516, "time_forward": 0.835272, "time_loss": 0.001032}
[03/28 10:30:34] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "3210", "eta": "2:21:58", "loss": 0.084629, "lr": 0.025314, "mode": "train", "time_backward": 1.127849, "time_data": 0.023642, "time_diff": 1.998914, "time_forward": 0.759619, "time_loss": 0.000413}
[03/28 10:30:49] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "3220", "eta": "2:21:22", "loss": 0.077828, "lr": 0.025330, "mode": "train", "time_backward": 1.067961, "time_data": 0.017276, "time_diff": 1.498320, "time_forward": 0.398084, "time_loss": 0.000226}
[03/28 10:31:11] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "3230", "eta": "2:20:53", "loss": 0.086775, "lr": 0.025347, "mode": "train", "time_backward": 1.063570, "time_data": 0.018050, "time_diff": 1.539616, "time_forward": 0.446545, "time_loss": 0.000223}
[03/28 10:31:27] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "3240", "eta": "2:20:24", "loss": 0.076444, "lr": 0.025363, "mode": "train", "time_backward": 1.067109, "time_data": 0.017026, "time_diff": 1.498508, "time_forward": 0.399928, "time_loss": 0.000290}
[03/28 10:31:43] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "3250", "eta": "2:17:12", "loss": 0.086017, "lr": 0.025379, "mode": "train", "time_backward": 1.086716, "time_data": 0.017643, "time_diff": 1.523650, "time_forward": 0.400872, "time_loss": 0.000346}
[03/28 10:31:58] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "3260", "eta": "2:16:43", "loss": 0.083018, "lr": 0.025396, "mode": "train", "time_backward": 1.113493, "time_data": 0.025595, "time_diff": 1.581828, "time_forward": 0.415373, "time_loss": 0.000261}
[03/28 10:32:15] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "3270", "eta": "2:16:14", "loss": 0.079149, "lr": 0.025412, "mode": "train", "time_backward": 1.090150, "time_data": 0.016627, "time_diff": 1.513375, "time_forward": 0.398600, "time_loss": 0.000361}
[03/28 10:32:33] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "3280", "eta": "2:15:44", "loss": 0.081128, "lr": 0.025428, "mode": "train", "time_backward": 1.056999, "time_data": 0.019729, "time_diff": 1.482101, "time_forward": 0.398159, "time_loss": 0.000252}
[03/28 10:32:55] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "3290", "eta": "2:15:15", "loss": 0.082713, "lr": 0.025445, "mode": "train", "time_backward": 1.054092, "time_data": 0.017696, "time_diff": 1.585696, "time_forward": 0.510084, "time_loss": 0.000309}
[03/28 10:33:10] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "3300", "eta": "2:14:46", "loss": 0.071515, "lr": 0.025461, "mode": "train", "time_backward": 1.119921, "time_data": 0.018203, "time_diff": 1.561188, "time_forward": 0.418679, "time_loss": 0.000274}
[03/28 10:33:26] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "3310", "eta": "2:14:18", "loss": 0.089098, "lr": 0.025477, "mode": "train", "time_backward": 1.076785, "time_data": 0.017299, "time_diff": 1.555240, "time_forward": 0.404411, "time_loss": 0.000300}
[03/28 10:33:43] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "3320", "eta": "2:13:49", "loss": 0.075391, "lr": 0.025493, "mode": "train", "time_backward": 1.053771, "time_data": 0.017343, "time_diff": 1.529153, "time_forward": 0.446018, "time_loss": 0.000236}
[03/28 10:33:58] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "3330", "eta": "2:13:21", "loss": 0.079244, "lr": 0.025510, "mode": "train", "time_backward": 1.161880, "time_data": 0.019269, "time_diff": 1.588933, "time_forward": 0.405035, "time_loss": 0.000272}
[03/28 10:34:15] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "3340", "eta": "2:12:53", "loss": 0.084977, "lr": 0.025526, "mode": "train", "time_backward": 1.173452, "time_data": 0.020392, "time_diff": 1.678482, "time_forward": 0.459401, "time_loss": 0.002450}
[03/28 10:34:30] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "3350", "eta": "2:12:25", "loss": 0.082671, "lr": 0.025542, "mode": "train", "time_backward": 1.182444, "time_data": 0.017389, "time_diff": 1.631707, "time_forward": 0.403443, "time_loss": 0.000579}
[03/28 10:34:45] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "3360", "eta": "2:11:56", "loss": 0.082971, "lr": 0.025559, "mode": "train", "time_backward": 1.098164, "time_data": 0.017341, "time_diff": 1.521392, "time_forward": 0.399633, "time_loss": 0.000421}
[03/28 10:35:16] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "3370", "eta": "2:11:29", "loss": 0.073646, "lr": 0.025575, "mode": "train", "time_backward": 1.163008, "time_data": 0.019508, "time_diff": 1.901937, "time_forward": 0.715632, "time_loss": 0.000509}
[03/28 10:35:31] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "3380", "eta": "2:11:01", "loss": 0.081447, "lr": 0.025591, "mode": "train", "time_backward": 1.122659, "time_data": 0.017718, "time_diff": 1.570238, "time_forward": 0.421172, "time_loss": 0.000389}
[03/28 10:35:47] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "3390", "eta": "2:10:32", "loss": 0.083926, "lr": 0.025608, "mode": "train", "time_backward": 1.113312, "time_data": 0.025123, "time_diff": 1.559271, "time_forward": 0.412849, "time_loss": 0.000308}
[03/28 10:36:03] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "3400", "eta": "2:09:46", "loss": 0.079903, "lr": 0.025624, "mode": "train", "time_backward": 1.205234, "time_data": 0.017209, "time_diff": 1.756821, "time_forward": 0.398144, "time_loss": 0.000305}
[03/28 10:36:19] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "3410", "eta": "2:09:17", "loss": 0.075545, "lr": 0.025640, "mode": "train", "time_backward": 1.081459, "time_data": 0.016665, "time_diff": 1.504878, "time_forward": 0.405994, "time_loss": 0.000367}
[03/28 10:36:35] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "3420", "eta": "2:08:49", "loss": 0.078646, "lr": 0.025657, "mode": "train", "time_backward": 1.074556, "time_data": 0.035154, "time_diff": 1.623959, "time_forward": 0.448677, "time_loss": 0.000338}
[03/28 10:36:52] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "3430", "eta": "2:08:20", "loss": 0.077627, "lr": 0.025673, "mode": "train", "time_backward": 1.102924, "time_data": 0.019074, "time_diff": 1.549349, "time_forward": 0.407819, "time_loss": 0.000345}
[03/28 10:37:08] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "3440", "eta": "2:07:52", "loss": 0.079744, "lr": 0.025689, "mode": "train", "time_backward": 1.090229, "time_data": 0.025087, "time_diff": 1.554075, "time_forward": 0.400914, "time_loss": 0.000424}
[03/28 10:37:23] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "3450", "eta": "2:07:13", "loss": 0.077799, "lr": 0.025706, "mode": "train", "time_backward": 1.087512, "time_data": 0.016855, "time_diff": 1.520209, "time_forward": 0.402818, "time_loss": 0.000257}
[03/28 10:37:40] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "3460", "eta": "2:06:44", "loss": 0.088209, "lr": 0.025722, "mode": "train", "time_backward": 1.061458, "time_data": 0.017678, "time_diff": 1.485588, "time_forward": 0.402817, "time_loss": 0.000376}
[03/28 10:39:04] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "3470", "eta": "2:06:15", "loss": 0.079621, "lr": 0.025738, "mode": "train", "time_backward": 1.066327, "time_data": 0.018604, "time_diff": 1.487047, "time_forward": 0.399416, "time_loss": 0.000416}
[03/28 10:39:22] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "3480", "eta": "2:05:47", "loss": 0.071846, "lr": 0.025755, "mode": "train", "time_backward": 1.202466, "time_data": 0.017563, "time_diff": 1.652704, "time_forward": 0.423752, "time_loss": 0.000274}
[03/28 10:39:38] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "3490", "eta": "2:05:18", "loss": 0.081049, "lr": 0.025771, "mode": "train", "time_backward": 1.142485, "time_data": 0.019001, "time_diff": 1.587864, "time_forward": 0.420172, "time_loss": 0.000289}
[03/28 10:39:53] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "3500", "eta": "2:04:50", "loss": 0.082811, "lr": 0.025787, "mode": "train", "time_backward": 1.082239, "time_data": 0.057655, "time_diff": 1.561078, "time_forward": 0.418010, "time_loss": 0.000296}
[03/28 10:40:15] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "3510", "eta": "2:04:22", "loss": 0.092268, "lr": 0.025804, "mode": "train", "time_backward": 1.082424, "time_data": 0.022738, "time_diff": 1.604313, "time_forward": 0.433209, "time_loss": 0.000423}
[03/28 10:40:35] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "3520", "eta": "2:03:53", "loss": 0.080786, "lr": 0.025820, "mode": "train", "time_backward": 1.116017, "time_data": 0.017242, "time_diff": 1.571493, "time_forward": 0.403131, "time_loss": 0.000260}
[03/28 10:40:51] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "3530", "eta": "2:02:34", "loss": 0.079190, "lr": 0.025836, "mode": "train", "time_backward": 1.065072, "time_data": 0.031170, "time_diff": 1.543510, "time_forward": 0.398548, "time_loss": 0.000229}
[03/28 10:41:09] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "3540", "eta": "2:02:05", "loss": 0.080923, "lr": 0.025853, "mode": "train", "time_backward": 1.061641, "time_data": 0.021573, "time_diff": 1.516197, "time_forward": 0.421308, "time_loss": 0.000230}
[03/28 10:41:25] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "3550", "eta": "2:01:38", "loss": 0.087625, "lr": 0.025869, "mode": "train", "time_backward": 1.143527, "time_data": 0.018751, "time_diff": 1.638213, "time_forward": 0.473139, "time_loss": 0.000358}
[03/28 10:41:42] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "3560", "eta": "2:01:09", "loss": 0.084988, "lr": 0.025885, "mode": "train", "time_backward": 1.063416, "time_data": 0.017584, "time_diff": 1.492994, "time_forward": 0.400058, "time_loss": 0.000276}
[03/28 10:42:01] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "3570", "eta": "2:00:41", "loss": 0.087975, "lr": 0.025902, "mode": "train", "time_backward": 1.115578, "time_data": 0.018062, "time_diff": 1.605463, "time_forward": 0.407308, "time_loss": 0.000284}
[03/28 10:42:17] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "3580", "eta": "2:00:12", "loss": 0.076565, "lr": 0.025918, "mode": "train", "time_backward": 1.066994, "time_data": 0.017414, "time_diff": 1.537465, "time_forward": 0.438656, "time_loss": 0.000351}
[03/28 10:42:33] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "3590", "eta": "1:59:36", "loss": 0.077268, "lr": 0.025934, "mode": "train", "time_backward": 1.106624, "time_data": 0.017381, "time_diff": 1.539920, "time_forward": 0.400214, "time_loss": 0.000862}
[03/28 10:42:48] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "3600", "eta": "1:59:08", "loss": 0.072456, "lr": 0.025951, "mode": "train", "time_backward": 1.060582, "time_data": 0.017385, "time_diff": 1.484039, "time_forward": 0.398381, "time_loss": 0.000361}
[03/28 10:43:04] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "3610", "eta": "1:58:40", "loss": 0.083382, "lr": 0.025967, "mode": "train", "time_backward": 1.204818, "time_data": 0.017413, "time_diff": 1.630068, "time_forward": 0.404191, "time_loss": 0.000366}
[03/28 10:43:21] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "3620", "eta": "1:58:12", "loss": 0.075863, "lr": 0.025983, "mode": "train", "time_backward": 1.205423, "time_data": 0.017337, "time_diff": 1.638930, "time_forward": 0.413472, "time_loss": 0.000377}
[03/28 10:43:37] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "3630", "eta": "1:57:44", "loss": 0.082790, "lr": 0.026000, "mode": "train", "time_backward": 1.089859, "time_data": 0.018308, "time_diff": 1.518888, "time_forward": 0.399486, "time_loss": 0.000342}
[03/28 10:43:53] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "3640", "eta": "1:54:57", "loss": 0.084805, "lr": 0.026016, "mode": "train", "time_backward": 1.054934, "time_data": 0.048590, "time_diff": 1.585457, "time_forward": 0.461368, "time_loss": 0.000341}
[03/28 10:44:09] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "3650", "eta": "1:54:29", "loss": 0.076957, "lr": 0.026032, "mode": "train", "time_backward": 1.105255, "time_data": 0.022998, "time_diff": 1.576367, "time_forward": 0.428274, "time_loss": 0.000526}
[03/28 10:44:25] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "3660", "eta": "1:54:03", "loss": 0.073955, "lr": 0.026049, "mode": "train", "time_backward": 1.280270, "time_data": 0.022055, "time_diff": 1.817616, "time_forward": 0.405198, "time_loss": 0.000585}
[03/28 10:44:40] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "3670", "eta": "1:53:35", "loss": 0.081701, "lr": 0.026065, "mode": "train", "time_backward": 1.113838, "time_data": 0.016820, "time_diff": 1.587235, "time_forward": 0.406134, "time_loss": 0.000357}
[03/28 10:44:59] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "3680", "eta": "1:53:08", "loss": 0.083385, "lr": 0.026081, "mode": "train", "time_backward": 1.060349, "time_data": 0.023677, "time_diff": 1.593993, "time_forward": 0.496551, "time_loss": 0.000356}
[03/28 10:45:15] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "3690", "eta": "1:52:41", "loss": 0.082192, "lr": 0.026098, "mode": "train", "time_backward": 1.160706, "time_data": 0.019815, "time_diff": 1.647233, "time_forward": 0.398478, "time_loss": 0.000297}
[03/28 10:45:34] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "3700", "eta": "1:52:12", "loss": 0.080227, "lr": 0.026114, "mode": "train", "time_backward": 1.055553, "time_data": 0.017673, "time_diff": 1.479625, "time_forward": 0.399810, "time_loss": 0.000300}
[03/28 10:45:52] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "3710", "eta": "1:51:44", "loss": 0.079761, "lr": 0.026130, "mode": "train", "time_backward": 1.055837, "time_data": 0.017542, "time_diff": 1.477490, "time_forward": 0.398758, "time_loss": 0.000308}
[03/28 10:46:27] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "3720", "eta": "1:51:17", "loss": 0.068142, "lr": 0.026147, "mode": "train", "time_backward": 1.094069, "time_data": 0.130915, "time_diff": 1.676662, "time_forward": 0.449346, "time_loss": 0.000425}
[03/28 10:46:43] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "3730", "eta": "1:50:49", "loss": 0.075906, "lr": 0.026163, "mode": "train", "time_backward": 1.063059, "time_data": 0.023133, "time_diff": 1.498740, "time_forward": 0.401832, "time_loss": 0.000368}
[03/28 10:47:02] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "3740", "eta": "1:49:09", "loss": 0.082219, "lr": 0.026179, "mode": "train", "time_backward": 1.098390, "time_data": 0.017668, "time_diff": 1.533119, "time_forward": 0.408558, "time_loss": 0.000399}
[03/28 10:47:30] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "3750", "eta": "1:48:41", "loss": 0.081235, "lr": 0.026196, "mode": "train", "time_backward": 1.100791, "time_data": 0.018863, "time_diff": 1.535182, "time_forward": 0.399077, "time_loss": 0.000236}
[03/28 10:47:53] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "3760", "eta": "1:48:13", "loss": 0.075638, "lr": 0.026212, "mode": "train", "time_backward": 1.210008, "time_data": 0.054421, "time_diff": 1.879785, "time_forward": 0.614454, "time_loss": 0.000410}
[03/28 10:48:18] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "3770", "eta": "1:47:46", "loss": 0.087174, "lr": 0.026228, "mode": "train", "time_backward": 1.064826, "time_data": 0.018033, "time_diff": 1.492973, "time_forward": 0.402759, "time_loss": 0.001875}
[03/28 10:48:33] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "3780", "eta": "1:47:18", "loss": 0.074095, "lr": 0.026244, "mode": "train", "time_backward": 1.071319, "time_data": 0.021486, "time_diff": 1.558472, "time_forward": 0.451002, "time_loss": 0.000488}
[03/28 10:48:48] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "3790", "eta": "1:46:50", "loss": 0.072101, "lr": 0.026261, "mode": "train", "time_backward": 1.154798, "time_data": 0.019967, "time_diff": 1.581985, "time_forward": 0.401793, "time_loss": 0.000249}
[03/28 10:49:06] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "3800", "eta": "1:46:22", "loss": 0.077502, "lr": 0.026277, "mode": "train", "time_backward": 1.085597, "time_data": 0.020714, "time_diff": 1.509480, "time_forward": 0.398081, "time_loss": 0.001753}
[03/28 10:49:21] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "3810", "eta": "1:45:55", "loss": 0.081375, "lr": 0.026293, "mode": "train", "time_backward": 1.060892, "time_data": 0.028014, "time_diff": 1.506503, "time_forward": 0.404498, "time_loss": 0.000336}
[03/28 10:49:37] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "3820", "eta": "1:45:10", "loss": 0.084309, "lr": 0.026310, "mode": "train", "time_backward": 1.112534, "time_data": 0.017975, "time_diff": 1.559540, "time_forward": 0.406860, "time_loss": 0.000314}
[03/28 10:49:53] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "3830", "eta": "1:44:44", "loss": 0.081150, "lr": 0.026326, "mode": "train", "time_backward": 1.352644, "time_data": 0.018002, "time_diff": 1.895285, "time_forward": 0.398742, "time_loss": 0.000376}
[03/28 10:50:08] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "3840", "eta": "1:44:16", "loss": 0.076412, "lr": 0.026342, "mode": "train", "time_backward": 1.064862, "time_data": 0.018681, "time_diff": 1.521561, "time_forward": 0.415005, "time_loss": 0.000275}
[03/28 10:50:24] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "3850", "eta": "1:43:48", "loss": 0.084285, "lr": 0.026359, "mode": "train", "time_backward": 1.066726, "time_data": 0.017502, "time_diff": 1.539404, "time_forward": 0.417113, "time_loss": 0.000293}
[03/28 10:50:39] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "3860", "eta": "1:43:20", "loss": 0.074300, "lr": 0.026375, "mode": "train", "time_backward": 1.059589, "time_data": 0.016799, "time_diff": 1.514146, "time_forward": 0.434948, "time_loss": 0.000257}
[03/28 10:50:57] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "3870", "eta": "1:42:53", "loss": 0.075473, "lr": 0.026391, "mode": "train", "time_backward": 1.075410, "time_data": 0.017166, "time_diff": 1.499656, "time_forward": 0.400388, "time_loss": 0.000492}
[03/28 10:51:12] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "3880", "eta": "1:42:26", "loss": 0.072023, "lr": 0.026408, "mode": "train", "time_backward": 1.065141, "time_data": 0.026633, "time_diff": 1.564222, "time_forward": 0.441634, "time_loss": 0.000255}
[03/28 10:51:27] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "3890", "eta": "1:40:33", "loss": 0.080439, "lr": 0.026424, "mode": "train", "time_backward": 1.055438, "time_data": 0.019408, "time_diff": 1.506257, "time_forward": 0.403889, "time_loss": 0.000339}
[03/28 10:51:47] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "3900", "eta": "1:40:05", "loss": 0.082059, "lr": 0.026440, "mode": "train", "time_backward": 1.056397, "time_data": 0.019878, "time_diff": 1.483080, "time_forward": 0.398672, "time_loss": 0.000265}
[03/28 10:52:03] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "3910", "eta": "1:39:13", "loss": 0.084900, "lr": 0.026457, "mode": "train", "time_backward": 1.070813, "time_data": 0.020301, "time_diff": 1.513204, "time_forward": 0.418679, "time_loss": 0.000319}
[03/28 10:52:18] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "3920", "eta": "1:38:46", "loss": 0.087098, "lr": 0.026473, "mode": "train", "time_backward": 1.075759, "time_data": 0.020506, "time_diff": 1.516264, "time_forward": 0.407040, "time_loss": 0.000428}
[03/28 10:52:34] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "3930", "eta": "1:38:19", "loss": 0.079318, "lr": 0.026489, "mode": "train", "time_backward": 1.056728, "time_data": 0.022643, "time_diff": 1.543552, "time_forward": 0.400846, "time_loss": 0.000407}
[03/28 10:52:54] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "3940", "eta": "1:37:52", "loss": 0.075567, "lr": 0.026506, "mode": "train", "time_backward": 1.056600, "time_data": 0.029557, "time_diff": 1.498140, "time_forward": 0.405445, "time_loss": 0.003204}
[03/28 10:53:10] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "3950", "eta": "1:37:25", "loss": 0.079728, "lr": 0.026522, "mode": "train", "time_backward": 1.056459, "time_data": 0.018752, "time_diff": 1.483636, "time_forward": 0.399495, "time_loss": 0.000222}
[03/28 10:53:25] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "3960", "eta": "1:36:58", "loss": 0.083204, "lr": 0.026538, "mode": "train", "time_backward": 1.056995, "time_data": 0.017084, "time_diff": 1.480389, "time_forward": 0.399533, "time_loss": 0.000262}
[03/28 10:53:55] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "3970", "eta": "1:36:31", "loss": 0.086840, "lr": 0.026555, "mode": "train", "time_backward": 1.055170, "time_data": 0.017110, "time_diff": 1.478419, "time_forward": 0.398992, "time_loss": 0.000268}
[03/28 10:54:48] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "3980", "eta": "1:36:04", "loss": 0.073987, "lr": 0.026571, "mode": "train", "time_backward": 1.083327, "time_data": 0.018522, "time_diff": 1.534113, "time_forward": 0.428702, "time_loss": 0.000272}
[03/28 10:55:04] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "3990", "eta": "1:34:37", "loss": 0.087461, "lr": 0.026587, "mode": "train", "time_backward": 1.089095, "time_data": 0.021241, "time_diff": 1.574412, "time_forward": 0.452195, "time_loss": 0.000265}
[03/28 10:55:25] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "4000", "eta": "1:33:48", "loss": 0.078764, "lr": 0.026604, "mode": "train", "time_backward": 1.078808, "time_data": 0.021315, "time_diff": 1.512728, "time_forward": 0.401053, "time_loss": 0.000504}
[03/28 10:55:47] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "4010", "eta": "1:33:22", "loss": 0.071370, "lr": 0.026620, "mode": "train", "time_backward": 1.174330, "time_data": 0.017242, "time_diff": 1.602700, "time_forward": 0.398373, "time_loss": 0.000259}
[03/28 10:56:19] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "4020", "eta": "1:32:55", "loss": 0.078328, "lr": 0.026636, "mode": "train", "time_backward": 1.059361, "time_data": 0.017219, "time_diff": 1.484113, "time_forward": 0.399902, "time_loss": 0.000254}
[03/28 10:56:34] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "4030", "eta": "1:32:28", "loss": 0.077554, "lr": 0.026653, "mode": "train", "time_backward": 1.054191, "time_data": 0.017346, "time_diff": 1.478825, "time_forward": 0.399040, "time_loss": 0.000279}
[03/28 10:56:52] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "4040", "eta": "1:32:01", "loss": 0.082528, "lr": 0.026669, "mode": "train", "time_backward": 1.056940, "time_data": 0.017503, "time_diff": 1.479521, "time_forward": 0.397230, "time_loss": 0.000267}
[03/28 10:57:22] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "4050", "eta": "1:30:20", "loss": 0.074290, "lr": 0.026685, "mode": "train", "time_backward": 1.055649, "time_data": 0.017244, "time_diff": 1.504341, "time_forward": 0.398633, "time_loss": 0.000286}
[03/28 10:57:43] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "4060", "eta": "1:29:55", "loss": 0.083390, "lr": 0.026702, "mode": "train", "time_backward": 1.140623, "time_data": 0.018897, "time_diff": 1.834761, "time_forward": 0.671245, "time_loss": 0.000375}
[03/28 10:57:58] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "4070", "eta": "1:29:29", "loss": 0.075565, "lr": 0.026718, "mode": "train", "time_backward": 1.056256, "time_data": 0.018257, "time_diff": 1.537144, "time_forward": 0.458989, "time_loss": 0.000427}
[03/28 10:58:25] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "4080", "eta": "1:29:02", "loss": 0.069527, "lr": 0.026734, "mode": "train", "time_backward": 1.058697, "time_data": 0.019839, "time_diff": 1.503735, "time_forward": 0.409358, "time_loss": 0.000326}
[03/28 10:58:49] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "4090", "eta": "1:28:50", "loss": 0.085999, "lr": 0.026751, "mode": "train", "time_backward": 4.441722, "time_data": 0.016964, "time_diff": 5.096997, "time_forward": 0.634909, "time_loss": 0.000366}
[03/28 10:59:04] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "4100", "eta": "1:28:24", "loss": 0.077609, "lr": 0.026767, "mode": "train", "time_backward": 1.071956, "time_data": 0.017143, "time_diff": 1.498565, "time_forward": 0.399727, "time_loss": 0.000411}
[03/28 10:59:34] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "4110", "eta": "1:27:57", "loss": 0.079038, "lr": 0.026783, "mode": "train", "time_backward": 1.054525, "time_data": 0.017618, "time_diff": 1.482625, "time_forward": 0.400097, "time_loss": 0.000256}
[03/28 10:59:49] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "4120", "eta": "1:27:31", "loss": 0.077103, "lr": 0.026800, "mode": "train", "time_backward": 1.086596, "time_data": 0.016886, "time_diff": 1.511208, "time_forward": 0.398351, "time_loss": 0.000239}
[03/28 11:00:05] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "4130", "eta": "1:27:04", "loss": 0.078157, "lr": 0.026816, "mode": "train", "time_backward": 1.075840, "time_data": 0.024458, "time_diff": 1.511596, "time_forward": 0.399871, "time_loss": 0.000266}
[03/28 11:00:22] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "4140", "eta": "1:26:38", "loss": 0.073133, "lr": 0.026832, "mode": "train", "time_backward": 1.052229, "time_data": 0.016898, "time_diff": 1.475056, "time_forward": 0.401112, "time_loss": 0.000244}
[03/28 11:01:36] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "4150", "eta": "1:26:11", "loss": 0.078174, "lr": 0.026849, "mode": "train", "time_backward": 1.055925, "time_data": 0.021153, "time_diff": 1.499907, "time_forward": 0.400130, "time_loss": 0.000299}
[03/28 11:01:51] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "4160", "eta": "1:25:45", "loss": 0.072785, "lr": 0.026865, "mode": "train", "time_backward": 1.061236, "time_data": 0.016802, "time_diff": 1.490377, "time_forward": 0.401607, "time_loss": 0.000230}
[03/28 11:02:06] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "4170", "eta": "1:25:18", "loss": 0.065052, "lr": 0.026881, "mode": "train", "time_backward": 1.055511, "time_data": 0.016973, "time_diff": 1.481460, "time_forward": 0.398499, "time_loss": 0.000263}
[03/28 11:02:39] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "4180", "eta": "1:24:52", "loss": 0.072832, "lr": 0.026898, "mode": "train", "time_backward": 1.067974, "time_data": 0.019916, "time_diff": 1.572563, "time_forward": 0.398673, "time_loss": 0.067986}
[03/28 11:02:55] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "4190", "eta": "1:24:26", "loss": 0.078272, "lr": 0.026914, "mode": "train", "time_backward": 1.133653, "time_data": 0.029859, "time_diff": 1.601772, "time_forward": 0.406261, "time_loss": 0.000477}
[03/28 11:03:13] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "4200", "eta": "1:24:00", "loss": 0.077839, "lr": 0.026930, "mode": "train", "time_backward": 1.058048, "time_data": 0.019688, "time_diff": 1.530855, "time_forward": 0.449897, "time_loss": 0.000250}
[03/28 11:03:36] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "4210", "eta": "1:23:45", "loss": 0.078406, "lr": 0.026946, "mode": "train", "time_backward": 3.904992, "time_data": 0.018443, "time_diff": 4.533838, "time_forward": 0.497782, "time_loss": 0.000260}
[03/28 11:04:04] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "4220", "eta": "1:21:57", "loss": 0.072928, "lr": 0.026963, "mode": "train", "time_backward": 1.058938, "time_data": 0.018418, "time_diff": 1.519465, "time_forward": 0.400569, "time_loss": 0.000247}
[03/28 11:04:20] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "4230", "eta": "1:21:30", "loss": 0.073332, "lr": 0.026979, "mode": "train", "time_backward": 1.055615, "time_data": 0.017009, "time_diff": 1.479176, "time_forward": 0.399549, "time_loss": 0.000250}
[03/28 11:04:41] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "4240", "eta": "1:21:04", "loss": 0.079715, "lr": 0.026995, "mode": "train", "time_backward": 1.067444, "time_data": 0.026212, "time_diff": 1.503638, "time_forward": 0.406399, "time_loss": 0.000265}
[03/28 11:05:07] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "4250", "eta": "1:20:38", "loss": 0.068780, "lr": 0.027012, "mode": "train", "time_backward": 1.056294, "time_data": 0.049338, "time_diff": 1.517327, "time_forward": 0.403969, "time_loss": 0.000789}
[03/28 11:05:27] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "4260", "eta": "1:20:11", "loss": 0.088523, "lr": 0.027028, "mode": "train", "time_backward": 1.054926, "time_data": 0.016944, "time_diff": 1.476881, "time_forward": 0.399059, "time_loss": 0.000232}
[03/28 11:05:44] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "4270", "eta": "1:19:45", "loss": 0.075490, "lr": 0.027044, "mode": "train", "time_backward": 1.129081, "time_data": 0.016823, "time_diff": 1.547789, "time_forward": 0.398552, "time_loss": 0.000273}
[03/28 11:06:13] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "4280", "eta": "1:19:20", "loss": 0.082976, "lr": 0.027061, "mode": "train", "time_backward": 1.129284, "time_data": 0.021565, "time_diff": 1.578417, "time_forward": 0.419359, "time_loss": 0.000275}
[03/28 11:06:37] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "4290", "eta": "1:19:23", "loss": 0.078095, "lr": 0.027077, "mode": "train", "time_backward": 9.039023, "time_data": 0.017050, "time_diff": 9.498624, "time_forward": 0.400324, "time_loss": 0.000264}
[03/28 11:06:53] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "4300", "eta": "1:18:57", "loss": 0.079670, "lr": 0.027093, "mode": "train", "time_backward": 1.058373, "time_data": 0.018065, "time_diff": 1.484591, "time_forward": 0.399868, "time_loss": 0.000275}
[03/28 11:07:31] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "4310", "eta": "1:18:30", "loss": 0.073194, "lr": 0.027110, "mode": "train", "time_backward": 1.055392, "time_data": 0.016965, "time_diff": 1.482877, "time_forward": 0.400586, "time_loss": 0.000378}
[03/28 11:08:37] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "4320", "eta": "1:18:04", "loss": 0.077541, "lr": 0.027126, "mode": "train", "time_backward": 1.081817, "time_data": 0.017013, "time_diff": 1.533878, "time_forward": 0.427071, "time_loss": 0.000409}
[03/28 11:08:58] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "4330", "eta": "1:17:39", "loss": 0.074789, "lr": 0.027142, "mode": "train", "time_backward": 1.135488, "time_data": 0.034107, "time_diff": 1.607474, "time_forward": 0.426905, "time_loss": 0.000719}
[03/28 11:09:14] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "4340", "eta": "1:17:13", "loss": 0.071649, "lr": 0.027159, "mode": "train", "time_backward": 1.059554, "time_data": 0.029692, "time_diff": 1.519690, "time_forward": 0.400804, "time_loss": 0.016729}
[03/28 11:09:32] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "4350", "eta": "1:16:46", "loss": 0.070525, "lr": 0.027175, "mode": "train", "time_backward": 1.059692, "time_data": 0.017699, "time_diff": 1.484603, "time_forward": 0.398942, "time_loss": 0.000345}
[03/28 11:09:51] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "4360", "eta": "1:16:19", "loss": 0.087484, "lr": 0.027191, "mode": "train", "time_backward": 1.061197, "time_data": 0.017964, "time_diff": 1.487445, "time_forward": 0.400172, "time_loss": 0.000235}
[03/28 11:10:14] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "4370", "eta": "1:16:17", "loss": 0.079451, "lr": 0.027208, "mode": "train", "time_backward": 7.840013, "time_data": 0.017601, "time_diff": 8.316280, "time_forward": 0.432516, "time_loss": 0.000295}
[03/28 11:10:30] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "4380", "eta": "1:15:50", "loss": 0.070048, "lr": 0.027224, "mode": "train", "time_backward": 1.071427, "time_data": 0.017164, "time_diff": 1.515317, "time_forward": 0.401522, "time_loss": 0.000535}
[03/28 11:10:45] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "4390", "eta": "1:15:24", "loss": 0.078853, "lr": 0.027240, "mode": "train", "time_backward": 1.055887, "time_data": 0.018015, "time_diff": 1.484281, "time_forward": 0.402738, "time_loss": 0.000761}
[03/28 11:11:01] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "4400", "eta": "1:14:58", "loss": 0.071474, "lr": 0.027257, "mode": "train", "time_backward": 1.143972, "time_data": 0.016910, "time_diff": 1.562241, "time_forward": 0.399902, "time_loss": 0.000428}
[03/28 11:11:31] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "4410", "eta": "1:14:31", "loss": 0.070838, "lr": 0.027273, "mode": "train", "time_backward": 1.075134, "time_data": 0.021009, "time_diff": 1.500205, "time_forward": 0.401049, "time_loss": 0.000367}
[03/28 11:11:46] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "4420", "eta": "1:14:05", "loss": 0.077890, "lr": 0.027289, "mode": "train", "time_backward": 1.070853, "time_data": 0.019014, "time_diff": 1.498810, "time_forward": 0.402419, "time_loss": 0.000294}
[03/28 11:12:01] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "4430", "eta": "1:13:38", "loss": 0.070361, "lr": 0.027306, "mode": "train", "time_backward": 1.059648, "time_data": 0.019578, "time_diff": 1.528559, "time_forward": 0.445501, "time_loss": 0.000465}
[03/28 11:12:20] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "4440", "eta": "1:13:12", "loss": 0.075391, "lr": 0.027322, "mode": "train", "time_backward": 1.103257, "time_data": 0.026151, "time_diff": 1.592371, "time_forward": 0.419440, "time_loss": 0.000259}
[03/28 11:12:36] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "4450", "eta": "1:12:46", "loss": 0.076558, "lr": 0.027338, "mode": "train", "time_backward": 1.078946, "time_data": 0.021889, "time_diff": 1.620339, "time_forward": 0.515822, "time_loss": 0.000347}
[03/28 11:12:53] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "4460", "eta": "1:12:20", "loss": 0.071013, "lr": 0.027355, "mode": "train", "time_backward": 1.120582, "time_data": 0.026888, "time_diff": 1.570800, "time_forward": 0.419675, "time_loss": 0.000292}
[03/28 11:13:09] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "4470", "eta": "1:11:54", "loss": 0.077555, "lr": 0.027371, "mode": "train", "time_backward": 1.174195, "time_data": 0.022284, "time_diff": 1.638851, "time_forward": 0.431103, "time_loss": 0.000450}
[03/28 11:13:25] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "4480", "eta": "1:11:28", "loss": 0.082016, "lr": 0.027387, "mode": "train", "time_backward": 1.129985, "time_data": 0.046149, "time_diff": 1.599025, "time_forward": 0.410587, "time_loss": 0.000309}
[03/28 11:13:43] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "4490", "eta": "1:11:02", "loss": 0.079293, "lr": 0.027404, "mode": "train", "time_backward": 1.073083, "time_data": 0.017010, "time_diff": 1.497703, "time_forward": 0.404120, "time_loss": 0.000282}
[03/28 11:13:59] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "4500", "eta": "1:10:35", "loss": 0.076597, "lr": 0.027420, "mode": "train", "time_backward": 1.070936, "time_data": 0.017799, "time_diff": 1.503487, "time_forward": 0.399237, "time_loss": 0.000320}
[03/28 11:14:15] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "4510", "eta": "1:10:10", "loss": 0.087480, "lr": 0.027436, "mode": "train", "time_backward": 1.110654, "time_data": 0.038878, "time_diff": 1.749485, "time_forward": 0.589634, "time_loss": 0.000423}
[03/28 11:14:31] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "4520", "eta": "1:09:44", "loss": 0.077215, "lr": 0.027453, "mode": "train", "time_backward": 1.077143, "time_data": 0.018095, "time_diff": 1.542154, "time_forward": 0.430359, "time_loss": 0.000292}
[03/28 11:14:50] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "4530", "eta": "1:09:17", "loss": 0.070378, "lr": 0.027469, "mode": "train", "time_backward": 1.108338, "time_data": 0.019677, "time_diff": 1.544040, "time_forward": 0.407933, "time_loss": 0.001072}
[03/28 11:15:06] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "4540", "eta": "1:08:51", "loss": 0.082005, "lr": 0.027485, "mode": "train", "time_backward": 1.087058, "time_data": 0.018291, "time_diff": 1.522109, "time_forward": 0.398523, "time_loss": 0.000312}
[03/28 11:15:22] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "4550", "eta": "1:08:25", "loss": 0.081963, "lr": 0.027502, "mode": "train", "time_backward": 1.122899, "time_data": 0.018815, "time_diff": 1.550049, "time_forward": 0.398815, "time_loss": 0.000921}
[03/28 11:15:38] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "4560", "eta": "1:07:58", "loss": 0.071511, "lr": 0.027518, "mode": "train", "time_backward": 1.117004, "time_data": 0.018014, "time_diff": 1.544238, "time_forward": 0.399893, "time_loss": 0.000364}
[03/28 11:15:54] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "4570", "eta": "1:07:32", "loss": 0.076733, "lr": 0.027534, "mode": "train", "time_backward": 1.076806, "time_data": 0.018291, "time_diff": 1.575286, "time_forward": 0.476474, "time_loss": 0.000337}
[03/28 11:16:10] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "4580", "eta": "1:07:06", "loss": 0.074080, "lr": 0.027551, "mode": "train", "time_backward": 1.062251, "time_data": 0.024463, "time_diff": 1.571296, "time_forward": 0.413792, "time_loss": 0.000398}
[03/28 11:16:25] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "4590", "eta": "1:06:40", "loss": 0.073307, "lr": 0.027567, "mode": "train", "time_backward": 1.114348, "time_data": 0.020877, "time_diff": 1.562446, "time_forward": 0.408512, "time_loss": 0.000403}
[03/28 11:16:41] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "4600", "eta": "1:06:14", "loss": 0.076079, "lr": 0.027583, "mode": "train", "time_backward": 1.086274, "time_data": 0.017962, "time_diff": 1.525850, "time_forward": 0.401298, "time_loss": 0.001304}
[03/28 11:16:57] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "4610", "eta": "1:03:22", "loss": 0.079885, "lr": 0.027600, "mode": "train", "time_backward": 1.097178, "time_data": 0.025191, "time_diff": 1.546267, "time_forward": 0.420165, "time_loss": 0.000371}
[03/28 11:17:13] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "4620", "eta": "1:02:57", "loss": 0.082184, "lr": 0.027616, "mode": "train", "time_backward": 1.183833, "time_data": 0.024322, "time_diff": 1.621239, "time_forward": 0.406834, "time_loss": 0.000355}
[03/28 11:17:28] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "4630", "eta": "1:02:32", "loss": 0.079895, "lr": 0.027632, "mode": "train", "time_backward": 1.057267, "time_data": 0.017213, "time_diff": 1.509761, "time_forward": 0.431186, "time_loss": 0.000296}
[03/28 11:17:44] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "4640", "eta": "1:02:06", "loss": 0.087694, "lr": 0.027648, "mode": "train", "time_backward": 1.076249, "time_data": 0.017371, "time_diff": 1.516137, "time_forward": 0.414160, "time_loss": 0.000309}
[03/28 11:18:01] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "4650", "eta": "1:01:42", "loss": 0.087576, "lr": 0.027665, "mode": "train", "time_backward": 1.131447, "time_data": 0.030082, "time_diff": 1.981251, "time_forward": 0.800946, "time_loss": 0.000345}
[03/28 11:18:16] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "4660", "eta": "1:01:17", "loss": 0.075083, "lr": 0.027681, "mode": "train", "time_backward": 1.067757, "time_data": 0.016856, "time_diff": 1.537264, "time_forward": 0.444111, "time_loss": 0.000224}
[03/28 11:18:32] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "4670", "eta": "1:00:52", "loss": 0.084023, "lr": 0.027697, "mode": "train", "time_backward": 1.070338, "time_data": 0.016957, "time_diff": 1.566153, "time_forward": 0.470709, "time_loss": 0.000312}
[03/28 11:18:49] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "4680", "eta": "1:00:27", "loss": 0.073456, "lr": 0.027714, "mode": "train", "time_backward": 1.147008, "time_data": 0.019921, "time_diff": 1.603566, "time_forward": 0.428409, "time_loss": 0.000314}
[03/28 11:19:04] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "4690", "eta": "0:59:29", "loss": 0.075636, "lr": 0.027730, "mode": "train", "time_backward": 1.067089, "time_data": 0.017083, "time_diff": 1.507295, "time_forward": 0.400266, "time_loss": 0.000244}
[03/28 11:19:19] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "4700", "eta": "0:59:03", "loss": 0.077923, "lr": 0.027746, "mode": "train", "time_backward": 1.062014, "time_data": 0.017506, "time_diff": 1.482969, "time_forward": 0.399811, "time_loss": 0.000401}
[03/28 11:19:36] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "4710", "eta": "0:58:38", "loss": 0.071788, "lr": 0.027763, "mode": "train", "time_backward": 1.062556, "time_data": 0.018257, "time_diff": 1.487486, "time_forward": 0.399144, "time_loss": 0.000326}
[03/28 11:19:51] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "4720", "eta": "0:58:13", "loss": 0.079828, "lr": 0.027779, "mode": "train", "time_backward": 1.066774, "time_data": 0.016921, "time_diff": 1.514936, "time_forward": 0.399867, "time_loss": 0.000599}
[03/28 11:20:13] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "4730", "eta": "0:57:48", "loss": 0.074984, "lr": 0.027795, "mode": "train", "time_backward": 1.192404, "time_data": 0.019358, "time_diff": 1.620292, "time_forward": 0.399685, "time_loss": 0.000420}
[03/28 11:20:29] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "4740", "eta": "0:57:23", "loss": 0.080708, "lr": 0.027812, "mode": "train", "time_backward": 1.067492, "time_data": 0.016741, "time_diff": 1.513778, "time_forward": 0.398834, "time_loss": 0.000253}
[03/28 11:20:44] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "4750", "eta": "0:56:58", "loss": 0.079344, "lr": 0.027828, "mode": "train", "time_backward": 1.062361, "time_data": 0.017959, "time_diff": 1.546523, "time_forward": 0.462495, "time_loss": 0.000415}
[03/28 11:21:00] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "4760", "eta": "0:56:33", "loss": 0.071495, "lr": 0.027844, "mode": "train", "time_backward": 1.092276, "time_data": 0.016961, "time_diff": 1.539278, "time_forward": 0.414002, "time_loss": 0.000253}
[03/28 11:21:17] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "4770", "eta": "0:56:04", "loss": 0.069773, "lr": 0.027861, "mode": "train", "time_backward": 1.088764, "time_data": 0.018028, "time_diff": 1.510738, "time_forward": 0.400214, "time_loss": 0.000310}
[03/28 11:21:32] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "4780", "eta": "0:55:39", "loss": 0.076699, "lr": 0.027877, "mode": "train", "time_backward": 1.087704, "time_data": 0.025252, "time_diff": 1.538137, "time_forward": 0.418295, "time_loss": 0.000384}
[03/28 11:21:49] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "4790", "eta": "0:55:14", "loss": 0.081082, "lr": 0.027893, "mode": "train", "time_backward": 1.077976, "time_data": 0.027299, "time_diff": 1.589253, "time_forward": 0.439669, "time_loss": 0.001828}
[03/28 11:22:04] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "4800", "eta": "0:54:49", "loss": 0.071848, "lr": 0.027910, "mode": "train", "time_backward": 1.053877, "time_data": 0.018801, "time_diff": 1.479858, "time_forward": 0.399080, "time_loss": 0.000237}
[03/28 11:22:56] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "4810", "eta": "0:54:24", "loss": 0.077146, "lr": 0.027926, "mode": "train", "time_backward": 1.074250, "time_data": 0.019636, "time_diff": 1.515316, "time_forward": 0.409418, "time_loss": 0.000334}
[03/28 11:23:11] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "4820", "eta": "0:53:59", "loss": 0.083660, "lr": 0.027942, "mode": "train", "time_backward": 1.123568, "time_data": 0.017323, "time_diff": 1.561820, "time_forward": 0.412829, "time_loss": 0.000410}
[03/28 11:23:26] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "4830", "eta": "0:53:34", "loss": 0.076616, "lr": 0.027959, "mode": "train", "time_backward": 1.175869, "time_data": 0.017181, "time_diff": 1.592375, "time_forward": 0.398636, "time_loss": 0.000232}
[03/28 11:23:42] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "4840", "eta": "0:53:09", "loss": 0.077348, "lr": 0.027975, "mode": "train", "time_backward": 1.179305, "time_data": 0.017652, "time_diff": 1.628260, "time_forward": 0.427621, "time_loss": 0.000367}
[03/28 11:23:59] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "4850", "eta": "0:50:23", "loss": 0.071711, "lr": 0.027991, "mode": "train", "time_backward": 1.061703, "time_data": 0.017247, "time_diff": 1.578900, "time_forward": 0.480874, "time_loss": 0.000260}
[03/28 11:24:14] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "4860", "eta": "0:49:59", "loss": 0.074770, "lr": 0.028008, "mode": "train", "time_backward": 1.059088, "time_data": 0.024539, "time_diff": 1.582398, "time_forward": 0.497170, "time_loss": 0.000326}
[03/28 11:24:30] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "4870", "eta": "0:49:33", "loss": 0.085634, "lr": 0.028024, "mode": "train", "time_backward": 1.097631, "time_data": 0.017677, "time_diff": 1.588359, "time_forward": 0.421779, "time_loss": 0.000332}
[03/28 11:24:46] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "4880", "eta": "0:49:09", "loss": 0.078585, "lr": 0.028040, "mode": "train", "time_backward": 1.156061, "time_data": 0.024280, "time_diff": 1.668688, "time_forward": 0.411679, "time_loss": 0.000467}
[03/28 11:25:03] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "4890", "eta": "0:48:45", "loss": 0.080110, "lr": 0.028057, "mode": "train", "time_backward": 1.108532, "time_data": 0.017842, "time_diff": 1.529473, "time_forward": 0.400481, "time_loss": 0.000341}
[03/28 11:25:18] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "4900", "eta": "0:48:21", "loss": 0.069266, "lr": 0.028073, "mode": "train", "time_backward": 1.053165, "time_data": 0.030783, "time_diff": 1.608985, "time_forward": 0.452193, "time_loss": 0.000316}
[03/28 11:25:34] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "4910", "eta": "0:47:57", "loss": 0.075716, "lr": 0.028089, "mode": "train", "time_backward": 1.084009, "time_data": 0.017726, "time_diff": 1.511013, "time_forward": 0.408340, "time_loss": 0.000269}
[03/28 11:25:50] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "4920", "eta": "0:47:34", "loss": 0.075764, "lr": 0.028106, "mode": "train", "time_backward": 1.230433, "time_data": 0.021902, "time_diff": 1.669153, "time_forward": 0.405361, "time_loss": 0.000342}
[03/28 11:26:07] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "4930", "eta": "0:46:03", "loss": 0.080218, "lr": 0.028122, "mode": "train", "time_backward": 1.064476, "time_data": 0.156802, "time_diff": 1.831973, "time_forward": 0.401759, "time_loss": 0.163956}
[03/28 11:26:23] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "4940", "eta": "0:45:40", "loss": 0.074252, "lr": 0.028138, "mode": "train", "time_backward": 1.063153, "time_data": 0.074378, "time_diff": 1.637444, "time_forward": 0.491625, "time_loss": 0.000417}
[03/28 11:26:43] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "4950", "eta": "0:45:17", "loss": 0.070500, "lr": 0.028155, "mode": "train", "time_backward": 1.127839, "time_data": 0.018272, "time_diff": 1.578799, "time_forward": 0.400993, "time_loss": 0.000332}
[03/28 11:26:58] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "4960", "eta": "0:44:53", "loss": 0.077168, "lr": 0.028171, "mode": "train", "time_backward": 1.055135, "time_data": 0.017912, "time_diff": 1.479876, "time_forward": 0.400633, "time_loss": 0.000320}
[03/28 11:27:14] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "4970", "eta": "0:44:30", "loss": 0.074319, "lr": 0.028187, "mode": "train", "time_backward": 1.274305, "time_data": 0.020007, "time_diff": 1.698933, "time_forward": 0.398809, "time_loss": 0.000223}
[03/28 11:27:30] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "4980", "eta": "0:44:00", "loss": 0.085118, "lr": 0.028204, "mode": "train", "time_backward": 1.075138, "time_data": 0.018879, "time_diff": 1.507390, "time_forward": 0.399003, "time_loss": 0.000241}
[03/28 11:27:46] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "4990", "eta": "0:43:37", "loss": 0.083988, "lr": 0.028220, "mode": "train", "time_backward": 1.076145, "time_data": 0.017114, "time_diff": 1.526662, "time_forward": 0.413921, "time_loss": 0.000231}
[03/28 11:28:02] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "5000", "eta": "0:43:14", "loss": 0.079801, "lr": 0.028236, "mode": "train", "time_backward": 1.066959, "time_data": 0.018078, "time_diff": 1.491115, "time_forward": 0.402372, "time_loss": 0.000339}
[03/28 11:28:18] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "5010", "eta": "0:42:51", "loss": 0.087861, "lr": 0.028253, "mode": "train", "time_backward": 1.056018, "time_data": 0.017887, "time_diff": 1.483549, "time_forward": 0.399294, "time_loss": 0.000303}
[03/28 11:28:34] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "5020", "eta": "0:42:27", "loss": 0.081405, "lr": 0.028269, "mode": "train", "time_backward": 1.167746, "time_data": 0.017016, "time_diff": 1.590061, "time_forward": 0.399330, "time_loss": 0.000365}
[03/28 11:28:50] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "5030", "eta": "0:42:04", "loss": 0.069699, "lr": 0.028285, "mode": "train", "time_backward": 1.071393, "time_data": 0.016839, "time_diff": 1.567148, "time_forward": 0.403670, "time_loss": 0.000405}
[03/28 11:29:05] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "5040", "eta": "0:41:41", "loss": 0.074583, "lr": 0.028302, "mode": "train", "time_backward": 1.071745, "time_data": 0.025137, "time_diff": 1.557571, "time_forward": 0.449407, "time_loss": 0.000281}
[03/28 11:29:21] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "5050", "eta": "0:41:17", "loss": 0.073007, "lr": 0.028318, "mode": "train", "time_backward": 1.088502, "time_data": 0.016963, "time_diff": 1.505446, "time_forward": 0.398347, "time_loss": 0.000233}
[03/28 11:29:37] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "5060", "eta": "0:40:54", "loss": 0.069773, "lr": 0.028334, "mode": "train", "time_backward": 1.079188, "time_data": 0.017132, "time_diff": 1.504641, "time_forward": 0.400281, "time_loss": 0.000408}
[03/28 11:29:54] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "5070", "eta": "0:40:31", "loss": 0.073738, "lr": 0.028350, "mode": "train", "time_backward": 1.084562, "time_data": 0.017512, "time_diff": 1.526836, "time_forward": 0.413534, "time_loss": 0.000246}
[03/28 11:30:10] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "5080", "eta": "0:40:07", "loss": 0.081523, "lr": 0.028367, "mode": "train", "time_backward": 1.054224, "time_data": 0.016934, "time_diff": 1.486393, "time_forward": 0.411692, "time_loss": 0.000258}
[03/28 11:30:38] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "5090", "eta": "0:39:44", "loss": 0.071727, "lr": 0.028383, "mode": "train", "time_backward": 1.064606, "time_data": 0.023915, "time_diff": 1.636482, "time_forward": 0.523014, "time_loss": 0.000318}
[03/28 11:30:54] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "5100", "eta": "0:39:21", "loss": 0.076973, "lr": 0.028399, "mode": "train", "time_backward": 1.096953, "time_data": 0.017212, "time_diff": 1.522446, "time_forward": 0.398427, "time_loss": 0.000279}
[03/28 11:31:10] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "5110", "eta": "0:38:56", "loss": 0.076475, "lr": 0.028416, "mode": "train", "time_backward": 1.068964, "time_data": 0.016819, "time_diff": 1.489095, "time_forward": 0.399698, "time_loss": 0.000264}
[03/28 11:31:26] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "5120", "eta": "0:38:33", "loss": 0.073195, "lr": 0.028432, "mode": "train", "time_backward": 1.119311, "time_data": 0.017399, "time_diff": 1.602214, "time_forward": 0.458065, "time_loss": 0.000319}
[03/28 11:31:46] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "5130", "eta": "0:38:15", "loss": 0.074700, "lr": 0.028448, "mode": "train", "time_backward": 1.968026, "time_data": 0.043798, "time_diff": 4.595417, "time_forward": 2.483742, "time_loss": 0.088668}
[03/28 11:32:02] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "5140", "eta": "0:37:47", "loss": 0.080930, "lr": 0.028465, "mode": "train", "time_backward": 1.073517, "time_data": 0.018467, "time_diff": 1.497942, "time_forward": 0.398725, "time_loss": 0.000344}
[03/28 11:32:17] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "5150", "eta": "0:37:23", "loss": 0.071569, "lr": 0.028481, "mode": "train", "time_backward": 1.100846, "time_data": 0.025888, "time_diff": 1.557904, "time_forward": 0.405571, "time_loss": 0.000935}
[03/28 11:32:32] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "5160", "eta": "0:37:00", "loss": 0.075575, "lr": 0.028497, "mode": "train", "time_backward": 1.086242, "time_data": 0.022596, "time_diff": 1.521555, "time_forward": 0.406115, "time_loss": 0.000253}
[03/28 11:32:50] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "5170", "eta": "0:36:36", "loss": 0.073145, "lr": 0.028514, "mode": "train", "time_backward": 1.082094, "time_data": 0.022145, "time_diff": 1.534595, "time_forward": 0.422170, "time_loss": 0.001005}
[03/28 11:33:05] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "5180", "eta": "0:36:13", "loss": 0.079316, "lr": 0.028530, "mode": "train", "time_backward": 1.056813, "time_data": 0.017698, "time_diff": 1.478909, "time_forward": 0.400362, "time_loss": 0.000306}
[03/28 11:33:21] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "5190", "eta": "0:35:50", "loss": 0.075816, "lr": 0.028546, "mode": "train", "time_backward": 1.230817, "time_data": 0.019750, "time_diff": 1.654064, "time_forward": 0.399833, "time_loss": 0.000241}
[03/28 11:33:37] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "5200", "eta": "0:35:27", "loss": 0.075557, "lr": 0.028563, "mode": "train", "time_backward": 1.085243, "time_data": 0.038863, "time_diff": 1.680280, "time_forward": 0.524635, "time_loss": 0.000295}
[03/28 11:33:53] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "5210", "eta": "0:35:04", "loss": 0.073784, "lr": 0.028579, "mode": "train", "time_backward": 1.131687, "time_data": 0.033814, "time_diff": 1.604830, "time_forward": 0.421008, "time_loss": 0.000372}
[03/28 11:34:09] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "5220", "eta": "0:34:33", "loss": 0.077697, "lr": 0.028595, "mode": "train", "time_backward": 1.137867, "time_data": 0.028263, "time_diff": 1.633486, "time_forward": 0.427275, "time_loss": 0.000241}
[03/28 11:34:25] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "5230", "eta": "0:34:10", "loss": 0.074588, "lr": 0.028612, "mode": "train", "time_backward": 1.058623, "time_data": 0.018481, "time_diff": 1.494094, "time_forward": 0.413182, "time_loss": 0.000366}
[03/28 11:34:42] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "5240", "eta": "0:33:47", "loss": 0.079641, "lr": 0.028628, "mode": "train", "time_backward": 1.078659, "time_data": 0.020333, "time_diff": 1.531451, "time_forward": 0.402320, "time_loss": 0.000267}
[03/28 11:34:58] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "5250", "eta": "0:33:18", "loss": 0.070553, "lr": 0.028644, "mode": "train", "time_backward": 1.061515, "time_data": 0.017056, "time_diff": 1.526690, "time_forward": 0.419661, "time_loss": 0.000306}
[03/28 11:35:14] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "5260", "eta": "0:32:38", "loss": 0.075064, "lr": 0.028661, "mode": "train", "time_backward": 1.055557, "time_data": 0.025623, "time_diff": 1.579842, "time_forward": 0.494943, "time_loss": 0.000326}
[03/28 11:35:30] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "5270", "eta": "0:32:15", "loss": 0.075605, "lr": 0.028677, "mode": "train", "time_backward": 1.135506, "time_data": 0.017839, "time_diff": 1.588588, "time_forward": 0.399792, "time_loss": 0.000345}
[03/28 11:35:46] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "5280", "eta": "0:31:52", "loss": 0.069858, "lr": 0.028693, "mode": "train", "time_backward": 1.090653, "time_data": 0.024048, "time_diff": 1.636369, "time_forward": 0.514182, "time_loss": 0.000301}
[03/28 11:36:02] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "5290", "eta": "0:31:29", "loss": 0.071931, "lr": 0.028710, "mode": "train", "time_backward": 1.115077, "time_data": 0.020462, "time_diff": 1.573284, "time_forward": 0.425944, "time_loss": 0.000430}
[03/28 11:36:17] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "5300", "eta": "0:31:06", "loss": 0.079361, "lr": 0.028726, "mode": "train", "time_backward": 1.100244, "time_data": 0.042798, "time_diff": 1.545153, "time_forward": 0.399583, "time_loss": 0.000359}
[03/28 11:36:33] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "5310", "eta": "0:30:43", "loss": 0.085854, "lr": 0.028742, "mode": "train", "time_backward": 1.098226, "time_data": 0.018372, "time_diff": 1.551725, "time_forward": 0.405894, "time_loss": 0.000280}
[03/28 11:36:48] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "5320", "eta": "0:30:20", "loss": 0.075447, "lr": 0.028759, "mode": "train", "time_backward": 1.051846, "time_data": 0.017393, "time_diff": 1.472766, "time_forward": 0.399911, "time_loss": 0.000217}
[03/28 11:37:07] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "5330", "eta": "0:29:57", "loss": 0.079219, "lr": 0.028775, "mode": "train", "time_backward": 1.064746, "time_data": 0.017334, "time_diff": 1.635148, "time_forward": 0.526066, "time_loss": 0.000433}
[03/28 11:37:29] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "5340", "eta": "0:29:35", "loss": 0.071971, "lr": 0.028791, "mode": "train", "time_backward": 1.084370, "time_data": 0.098831, "time_diff": 1.624190, "time_forward": 0.437244, "time_loss": 0.000421}
[03/28 11:37:44] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "5350", "eta": "0:29:11", "loss": 0.079628, "lr": 0.028808, "mode": "train", "time_backward": 1.056860, "time_data": 0.017356, "time_diff": 1.489115, "time_forward": 0.399844, "time_loss": 0.000274}
[03/28 11:38:02] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "5360", "eta": "0:27:58", "loss": 0.071256, "lr": 0.028824, "mode": "train", "time_backward": 1.055834, "time_data": 0.017233, "time_diff": 1.480338, "time_forward": 0.399921, "time_loss": 0.000340}
[03/28 11:39:14] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "5370", "eta": "0:27:36", "loss": 0.068959, "lr": 0.028840, "mode": "train", "time_backward": 1.057165, "time_data": 0.017034, "time_diff": 1.477962, "time_forward": 0.399342, "time_loss": 0.000268}
[03/28 11:39:35] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "5380", "eta": "0:27:16", "loss": 0.074304, "lr": 0.028857, "mode": "train", "time_backward": 1.614282, "time_data": 0.640011, "time_diff": 3.097660, "time_forward": 0.443603, "time_loss": 0.000471}
[03/28 11:40:00] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "5390", "eta": "0:26:53", "loss": 0.080086, "lr": 0.028873, "mode": "train", "time_backward": 1.086923, "time_data": 0.022528, "time_diff": 1.522411, "time_forward": 0.398778, "time_loss": 0.000248}
[03/28 11:40:16] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "5400", "eta": "0:26:31", "loss": 0.073811, "lr": 0.028889, "mode": "train", "time_backward": 1.096599, "time_data": 0.027481, "time_diff": 1.542322, "time_forward": 0.410842, "time_loss": 0.000266}
[03/28 11:40:32] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "5410", "eta": "0:26:09", "loss": 0.068123, "lr": 0.028906, "mode": "train", "time_backward": 1.118894, "time_data": 0.031555, "time_diff": 1.558627, "time_forward": 0.399202, "time_loss": 0.000251}
[03/28 11:40:48] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "5420", "eta": "0:25:47", "loss": 0.075995, "lr": 0.028922, "mode": "train", "time_backward": 1.128764, "time_data": 0.021107, "time_diff": 1.796114, "time_forward": 0.629373, "time_loss": 0.000572}
[03/28 11:41:04] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "5430", "eta": "0:25:24", "loss": 0.075927, "lr": 0.028938, "mode": "train", "time_backward": 1.095183, "time_data": 0.017627, "time_diff": 1.580521, "time_forward": 0.402167, "time_loss": 0.000798}
[03/28 11:41:20] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "5440", "eta": "0:25:02", "loss": 0.071047, "lr": 0.028955, "mode": "train", "time_backward": 1.229834, "time_data": 0.016783, "time_diff": 1.646363, "time_forward": 0.399129, "time_loss": 0.000260}
[03/28 11:41:36] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "5450", "eta": "0:24:40", "loss": 0.080264, "lr": 0.028971, "mode": "train", "time_backward": 1.080195, "time_data": 0.017298, "time_diff": 1.515504, "time_forward": 0.413738, "time_loss": 0.000402}
[03/28 11:41:51] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "5460", "eta": "0:24:17", "loss": 0.074703, "lr": 0.028987, "mode": "train", "time_backward": 1.082077, "time_data": 0.063011, "time_diff": 1.558881, "time_forward": 0.410043, "time_loss": 0.000266}
[03/28 11:42:07] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "5470", "eta": "0:23:55", "loss": 0.077713, "lr": 0.029004, "mode": "train", "time_backward": 1.096365, "time_data": 0.020028, "time_diff": 1.540336, "time_forward": 0.400887, "time_loss": 0.000260}
[03/28 11:42:23] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "5480", "eta": "0:23:33", "loss": 0.072906, "lr": 0.029020, "mode": "train", "time_backward": 1.111948, "time_data": 0.016759, "time_diff": 1.549277, "time_forward": 0.413625, "time_loss": 0.000278}
[03/28 11:42:40] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "5490", "eta": "0:23:10", "loss": 0.074543, "lr": 0.029036, "mode": "train", "time_backward": 1.135312, "time_data": 0.016995, "time_diff": 1.563453, "time_forward": 0.399165, "time_loss": 0.000270}
[03/28 11:42:56] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "5500", "eta": "0:22:48", "loss": 0.074423, "lr": 0.029053, "mode": "train", "time_backward": 1.094434, "time_data": 0.033151, "time_diff": 1.538785, "time_forward": 0.407594, "time_loss": 0.000245}
[03/28 11:43:12] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "5510", "eta": "0:22:26", "loss": 0.068918, "lr": 0.029069, "mode": "train", "time_backward": 1.121842, "time_data": 0.021730, "time_diff": 1.677985, "time_forward": 0.473135, "time_loss": 0.000245}
[03/28 11:43:28] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "5520", "eta": "0:22:03", "loss": 0.074143, "lr": 0.029085, "mode": "train", "time_backward": 1.066327, "time_data": 0.017066, "time_diff": 1.553635, "time_forward": 0.466600, "time_loss": 0.000342}
[03/28 11:43:44] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "5530", "eta": "0:21:41", "loss": 0.076780, "lr": 0.029101, "mode": "train", "time_backward": 1.091414, "time_data": 0.020137, "time_diff": 1.520921, "time_forward": 0.401692, "time_loss": 0.000240}
[03/28 11:44:00] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "5540", "eta": "0:21:19", "loss": 0.072074, "lr": 0.029118, "mode": "train", "time_backward": 1.143130, "time_data": 0.024328, "time_diff": 1.624374, "time_forward": 0.453001, "time_loss": 0.000607}
[03/28 11:44:15] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "5550", "eta": "0:20:56", "loss": 0.071331, "lr": 0.029134, "mode": "train", "time_backward": 1.136577, "time_data": 0.017468, "time_diff": 1.577386, "time_forward": 0.401048, "time_loss": 0.002282}
[03/28 11:44:31] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "5560", "eta": "0:20:34", "loss": 0.071268, "lr": 0.029150, "mode": "train", "time_backward": 1.182631, "time_data": 0.022014, "time_diff": 1.607767, "time_forward": 0.402316, "time_loss": 0.000379}
[03/28 11:44:48] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "5570", "eta": "0:20:12", "loss": 0.076659, "lr": 0.029167, "mode": "train", "time_backward": 1.084128, "time_data": 0.041155, "time_diff": 1.527402, "time_forward": 0.399345, "time_loss": 0.000270}
[03/28 11:45:03] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "5580", "eta": "0:19:49", "loss": 0.069651, "lr": 0.029183, "mode": "train", "time_backward": 1.114536, "time_data": 0.016994, "time_diff": 1.577045, "time_forward": 0.444769, "time_loss": 0.000362}
[03/28 11:45:19] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "5590", "eta": "0:19:16", "loss": 0.073960, "lr": 0.029199, "mode": "train", "time_backward": 1.056164, "time_data": 0.016895, "time_diff": 1.477638, "time_forward": 0.401151, "time_loss": 0.000297}
[03/28 11:45:37] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "5600", "eta": "0:18:53", "loss": 0.073677, "lr": 0.029216, "mode": "train", "time_backward": 1.121698, "time_data": 0.017293, "time_diff": 1.542088, "time_forward": 0.399710, "time_loss": 0.000403}
[03/28 11:45:52] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "5610", "eta": "0:18:31", "loss": 0.074845, "lr": 0.029232, "mode": "train", "time_backward": 1.056890, "time_data": 0.017757, "time_diff": 1.487593, "time_forward": 0.400184, "time_loss": 0.000361}
[03/28 11:46:08] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "5620", "eta": "0:17:37", "loss": 0.078039, "lr": 0.029248, "mode": "train", "time_backward": 1.057435, "time_data": 0.016988, "time_diff": 1.477516, "time_forward": 0.399428, "time_loss": 0.000420}
[03/28 11:46:25] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "5630", "eta": "0:17:15", "loss": 0.068170, "lr": 0.029265, "mode": "train", "time_backward": 1.069742, "time_data": 0.017360, "time_diff": 1.492309, "time_forward": 0.398332, "time_loss": 0.000240}
[03/28 11:46:46] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "5640", "eta": "0:16:54", "loss": 0.077959, "lr": 0.029281, "mode": "train", "time_backward": 1.109756, "time_data": 0.016829, "time_diff": 1.541857, "time_forward": 0.399976, "time_loss": 0.000326}
[03/28 11:47:02] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "5650", "eta": "0:16:32", "loss": 0.081895, "lr": 0.029297, "mode": "train", "time_backward": 1.076642, "time_data": 0.017136, "time_diff": 1.552013, "time_forward": 0.397871, "time_loss": 0.000236}
[03/28 11:47:33] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "5660", "eta": "0:16:11", "loss": 0.071735, "lr": 0.029314, "mode": "train", "time_backward": 1.069851, "time_data": 0.054301, "time_diff": 1.551331, "time_forward": 0.399052, "time_loss": 0.000293}
[03/28 11:47:49] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "5670", "eta": "0:15:49", "loss": 0.077007, "lr": 0.029330, "mode": "train", "time_backward": 1.074837, "time_data": 0.020297, "time_diff": 1.504375, "time_forward": 0.402708, "time_loss": 0.001157}
[03/28 11:48:09] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "5680", "eta": "0:15:28", "loss": 0.071091, "lr": 0.029346, "mode": "train", "time_backward": 1.128624, "time_data": 0.016924, "time_diff": 1.580637, "time_forward": 0.419784, "time_loss": 0.000256}
[03/28 11:48:26] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "5690", "eta": "0:15:06", "loss": 0.072654, "lr": 0.029363, "mode": "train", "time_backward": 1.062682, "time_data": 0.017288, "time_diff": 1.489987, "time_forward": 0.399868, "time_loss": 0.000391}
[03/28 11:48:41] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "5700", "eta": "0:14:37", "loss": 0.074891, "lr": 0.029379, "mode": "train", "time_backward": 1.057652, "time_data": 0.017723, "time_diff": 1.486614, "time_forward": 0.403824, "time_loss": 0.000645}
[03/28 11:49:06] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "5710", "eta": "0:14:16", "loss": 0.075034, "lr": 0.029395, "mode": "train", "time_backward": 1.109409, "time_data": 0.017956, "time_diff": 1.637795, "time_forward": 0.467038, "time_loss": 0.000291}
[03/28 11:49:49] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "5720", "eta": "0:13:55", "loss": 0.075202, "lr": 0.029412, "mode": "train", "time_backward": 1.062601, "time_data": 0.020344, "time_diff": 1.495247, "time_forward": 0.408782, "time_loss": 0.000233}
[03/28 11:50:37] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "5730", "eta": "0:13:33", "loss": 0.072892, "lr": 0.029428, "mode": "train", "time_backward": 1.091974, "time_data": 0.019052, "time_diff": 1.571586, "time_forward": 0.456811, "time_loss": 0.000395}
[03/28 11:50:53] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "5740", "eta": "0:12:58", "loss": 0.076899, "lr": 0.029444, "mode": "train", "time_backward": 1.056567, "time_data": 0.017729, "time_diff": 1.479759, "time_forward": 0.401750, "time_loss": 0.000400}
[03/28 11:51:08] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "5750", "eta": "0:12:37", "loss": 0.079044, "lr": 0.029461, "mode": "train", "time_backward": 1.060173, "time_data": 0.017670, "time_diff": 1.483687, "time_forward": 0.398995, "time_loss": 0.000323}
[03/28 11:51:26] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "5760", "eta": "0:12:18", "loss": 0.075033, "lr": 0.029477, "mode": "train", "time_backward": 4.128369, "time_data": 0.017226, "time_diff": 4.553212, "time_forward": 0.405894, "time_loss": 0.001125}
[03/28 11:51:47] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "5770", "eta": "0:11:57", "loss": 0.073528, "lr": 0.029493, "mode": "train", "time_backward": 1.129247, "time_data": 0.018565, "time_diff": 1.569598, "time_forward": 0.421131, "time_loss": 0.000245}
[03/28 11:52:05] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "5780", "eta": "0:11:36", "loss": 0.077593, "lr": 0.029510, "mode": "train", "time_backward": 1.056235, "time_data": 0.017661, "time_diff": 1.480760, "time_forward": 0.400142, "time_loss": 0.000397}
[03/28 11:52:28] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "5790", "eta": "0:11:15", "loss": 0.077200, "lr": 0.029526, "mode": "train", "time_backward": 1.055743, "time_data": 0.017152, "time_diff": 1.479994, "time_forward": 0.399522, "time_loss": 0.000285}
[03/28 11:52:48] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "5800", "eta": "0:10:54", "loss": 0.074869, "lr": 0.029542, "mode": "train", "time_backward": 1.055746, "time_data": 0.017221, "time_diff": 1.544193, "time_forward": 0.438153, "time_loss": 0.000253}
[03/28 11:53:08] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "5810", "eta": "0:10:33", "loss": 0.069917, "lr": 0.029559, "mode": "train", "time_backward": 1.063205, "time_data": 0.016862, "time_diff": 1.489310, "time_forward": 0.401900, "time_loss": 0.000335}
[03/28 11:53:34] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "5820", "eta": "0:10:10", "loss": 0.076193, "lr": 0.029575, "mode": "train", "time_backward": 1.057239, "time_data": 0.017413, "time_diff": 1.519253, "time_forward": 0.398949, "time_loss": 0.000259}
[03/28 11:54:14] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "5830", "eta": "0:09:49", "loss": 0.073203, "lr": 0.029591, "mode": "train", "time_backward": 1.057930, "time_data": 0.017386, "time_diff": 1.481418, "time_forward": 0.402575, "time_loss": 0.000472}
[03/28 11:54:30] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "5840", "eta": "0:09:28", "loss": 0.071721, "lr": 0.029608, "mode": "train", "time_backward": 1.056421, "time_data": 0.016989, "time_diff": 1.486913, "time_forward": 0.400546, "time_loss": 0.000239}
[03/28 11:54:54] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "5850", "eta": "0:09:07", "loss": 0.077126, "lr": 0.029624, "mode": "train", "time_backward": 1.106552, "time_data": 0.025026, "time_diff": 1.563393, "time_forward": 0.412509, "time_loss": 0.000242}
[03/28 11:55:11] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "5860", "eta": "0:08:46", "loss": 0.079912, "lr": 0.029640, "mode": "train", "time_backward": 1.305270, "time_data": 0.016961, "time_diff": 1.738417, "time_forward": 0.399789, "time_loss": 0.000280}
[03/28 11:55:30] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "5870", "eta": "0:08:25", "loss": 0.078101, "lr": 0.029657, "mode": "train", "time_backward": 1.061792, "time_data": 0.018410, "time_diff": 1.483955, "time_forward": 0.399931, "time_loss": 0.000517}
[03/28 11:56:13] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "5880", "eta": "0:08:13", "loss": 0.071973, "lr": 0.029673, "mode": "train", "time_backward": 19.276230, "time_data": 0.016768, "time_diff": 19.697605, "time_forward": 0.401057, "time_loss": 0.000643}
[03/28 11:56:28] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "5890", "eta": "0:07:51", "loss": 0.078792, "lr": 0.029689, "mode": "train", "time_backward": 1.083979, "time_data": 0.016986, "time_diff": 1.509032, "time_forward": 0.399397, "time_loss": 0.000295}
[03/28 11:56:44] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "5900", "eta": "0:07:30", "loss": 0.071729, "lr": 0.029706, "mode": "train", "time_backward": 1.090031, "time_data": 0.017927, "time_diff": 1.566221, "time_forward": 0.454411, "time_loss": 0.000485}
[03/28 11:56:59] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "5910", "eta": "0:07:09", "loss": 0.068967, "lr": 0.029722, "mode": "train", "time_backward": 1.058799, "time_data": 0.018284, "time_diff": 1.482607, "time_forward": 0.399454, "time_loss": 0.000400}
[03/28 11:57:30] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "5920", "eta": "0:06:43", "loss": 0.073457, "lr": 0.029738, "mode": "train", "time_backward": 1.055890, "time_data": 0.019029, "time_diff": 1.533555, "time_forward": 0.455129, "time_loss": 0.000265}
[03/28 11:57:51] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "5930", "eta": "0:06:22", "loss": 0.066752, "lr": 0.029755, "mode": "train", "time_backward": 1.055145, "time_data": 0.017037, "time_diff": 1.478317, "time_forward": 0.398804, "time_loss": 0.000259}
[03/28 11:58:06] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "5940", "eta": "0:06:01", "loss": 0.067371, "lr": 0.029771, "mode": "train", "time_backward": 1.058064, "time_data": 0.020273, "time_diff": 1.491829, "time_forward": 0.399039, "time_loss": 0.000278}
[03/28 11:58:31] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "5950", "eta": "0:05:40", "loss": 0.074011, "lr": 0.029787, "mode": "train", "time_backward": 1.072029, "time_data": 0.018688, "time_diff": 1.497956, "time_forward": 0.398861, "time_loss": 0.000292}
[03/28 11:58:50] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "5960", "eta": "0:05:19", "loss": 0.072181, "lr": 0.029803, "mode": "train", "time_backward": 1.134509, "time_data": 0.017217, "time_diff": 1.561849, "time_forward": 0.398881, "time_loss": 0.000265}
[03/28 11:59:06] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "5970", "eta": "0:04:57", "loss": 0.076393, "lr": 0.029820, "mode": "train", "time_backward": 1.055305, "time_data": 0.016773, "time_diff": 1.516502, "time_forward": 0.398063, "time_loss": 0.000289}
[03/28 11:59:29] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "5980", "eta": "0:04:36", "loss": 0.078702, "lr": 0.029836, "mode": "train", "time_backward": 1.064098, "time_data": 0.019100, "time_diff": 1.491931, "time_forward": 0.400338, "time_loss": 0.000366}
[03/28 12:00:09] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "5990", "eta": "0:04:15", "loss": 0.073431, "lr": 0.029852, "mode": "train", "time_backward": 1.083910, "time_data": 0.023089, "time_diff": 1.516979, "time_forward": 0.401812, "time_loss": 0.000400}
[03/28 12:00:26] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "6000", "eta": "0:03:54", "loss": 0.068127, "lr": 0.029869, "mode": "train", "time_backward": 1.077891, "time_data": 0.018902, "time_diff": 1.500315, "time_forward": 0.399908, "time_loss": 0.000276}
[03/28 12:00:47] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "6010", "eta": "0:03:32", "loss": 0.072954, "lr": 0.029885, "mode": "train", "time_backward": 1.119362, "time_data": 0.018558, "time_diff": 1.542681, "time_forward": 0.401653, "time_loss": 0.000331}
[03/28 12:01:04] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "6020", "eta": "0:03:11", "loss": 0.070190, "lr": 0.029901, "mode": "train", "time_backward": 1.056912, "time_data": 0.016931, "time_diff": 1.479764, "time_forward": 0.399503, "time_loss": 0.000244}
[03/28 12:01:25] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "6030", "eta": "0:02:50", "loss": 0.072863, "lr": 0.029918, "mode": "train", "time_backward": 1.058918, "time_data": 0.017202, "time_diff": 1.483986, "time_forward": 0.399844, "time_loss": 0.000268}
[03/28 12:01:50] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "6040", "eta": "0:02:29", "loss": 0.079521, "lr": 0.029934, "mode": "train", "time_backward": 1.080750, "time_data": 0.016986, "time_diff": 2.004361, "time_forward": 0.895111, "time_loss": 0.000292}
[03/28 12:02:11] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "6050", "eta": "0:02:08", "loss": 0.069823, "lr": 0.029950, "mode": "train", "time_backward": 1.066600, "time_data": 0.017595, "time_diff": 1.493466, "time_forward": 0.406349, "time_loss": 0.000447}
[03/28 12:02:34] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "6060", "eta": "0:01:47", "loss": 0.068236, "lr": 0.029967, "mode": "train", "time_backward": 1.064879, "time_data": 0.021909, "time_diff": 1.514375, "time_forward": 0.404936, "time_loss": 0.000365}
[03/28 12:02:49] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "6070", "eta": "0:01:26", "loss": 0.070245, "lr": 0.029983, "mode": "train", "time_backward": 1.057826, "time_data": 0.017536, "time_diff": 1.486886, "time_forward": 0.398841, "time_loss": 0.000285}
[03/28 12:03:20] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "6080", "eta": "0:01:05", "loss": 0.064000, "lr": 0.029999, "mode": "train", "time_backward": 1.109700, "time_data": 0.018440, "time_diff": 1.553576, "time_forward": 0.417527, "time_loss": 0.000273}
[03/28 12:03:42] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "6090", "eta": "0:00:44", "loss": 0.076326, "lr": 0.030016, "mode": "train", "time_backward": 1.057914, "time_data": 0.017042, "time_diff": 1.483657, "time_forward": 0.400635, "time_loss": 0.000454}
[03/28 12:03:57] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "6100", "eta": "0:00:23", "loss": 0.072671, "lr": 0.030032, "mode": "train", "time_backward": 1.053452, "time_data": 0.017483, "time_diff": 1.476281, "time_forward": 0.398324, "time_loss": 0.000379}
[03/28 12:04:12] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "3", "cur_iter": "6110", "eta": "0:00:02", "loss": 0.075039, "lr": 0.030048, "mode": "train", "time_backward": 1.053422, "time_data": 0.016379, "time_diff": 1.475205, "time_forward": 0.396655, "time_loss": 0.000212}
[03/28 12:42:47] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "10", "eta": "3:34:31", "loss": 0.072757, "lr": 0.030065, "mode": "train", "time_backward": 1.140390, "time_data": 0.016991, "time_diff": 1.581076, "time_forward": 0.402651, "time_loss": 0.000414}
[03/28 12:43:04] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "20", "eta": "3:34:10", "loss": 0.073901, "lr": 0.030081, "mode": "train", "time_backward": 1.059046, "time_data": 0.017246, "time_diff": 1.516690, "time_forward": 0.400725, "time_loss": 0.000349}
[03/28 12:43:20] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "30", "eta": "3:33:50", "loss": 0.071071, "lr": 0.030097, "mode": "train", "time_backward": 1.104616, "time_data": 0.016871, "time_diff": 1.530268, "time_forward": 0.399166, "time_loss": 0.000285}
[03/28 12:43:35] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "40", "eta": "3:33:29", "loss": 0.081263, "lr": 0.030114, "mode": "train", "time_backward": 1.054863, "time_data": 0.018238, "time_diff": 1.515070, "time_forward": 0.435870, "time_loss": 0.000243}
[03/28 12:44:06] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "50", "eta": "3:29:50", "loss": 0.069328, "lr": 0.030130, "mode": "train", "time_backward": 1.129733, "time_data": 0.019220, "time_diff": 1.551869, "time_forward": 0.402251, "time_loss": 0.000369}
[03/28 12:44:21] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "60", "eta": "3:29:30", "loss": 0.075832, "lr": 0.030146, "mode": "train", "time_backward": 1.093544, "time_data": 0.019952, "time_diff": 1.520632, "time_forward": 0.400386, "time_loss": 0.000230}
[03/28 12:44:45] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "70", "eta": "3:29:08", "loss": 0.069663, "lr": 0.030163, "mode": "train", "time_backward": 1.055202, "time_data": 0.017002, "time_diff": 1.479960, "time_forward": 0.398697, "time_loss": 0.000295}
[03/28 12:45:02] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "80", "eta": "3:28:50", "loss": 0.076988, "lr": 0.030179, "mode": "train", "time_backward": 1.136357, "time_data": 0.104817, "time_diff": 1.715031, "time_forward": 0.409309, "time_loss": 0.000331}
[03/28 12:45:26] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "90", "eta": "3:25:13", "loss": 0.077239, "lr": 0.030195, "mode": "train", "time_backward": 1.093471, "time_data": 0.021781, "time_diff": 1.521033, "time_forward": 0.398065, "time_loss": 0.000253}
[03/28 12:45:43] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "100", "eta": "3:25:19", "loss": 0.075385, "lr": 0.030212, "mode": "train", "time_backward": 3.275213, "time_data": 0.017112, "time_diff": 3.762252, "time_forward": 0.403882, "time_loss": 0.000276}
[03/28 12:45:59] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "110", "eta": "3:24:59", "loss": 0.070469, "lr": 0.030228, "mode": "train", "time_backward": 1.055601, "time_data": 0.017764, "time_diff": 1.528556, "time_forward": 0.451691, "time_loss": 0.000362}
[03/28 12:46:16] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "120", "eta": "3:24:39", "loss": 0.073706, "lr": 0.030244, "mode": "train", "time_backward": 1.149601, "time_data": 0.021873, "time_diff": 1.575939, "time_forward": 0.400876, "time_loss": 0.000243}
[03/28 12:46:35] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "130", "eta": "3:20:58", "loss": 0.071674, "lr": 0.030261, "mode": "train", "time_backward": 1.062983, "time_data": 0.020326, "time_diff": 1.534715, "time_forward": 0.447357, "time_loss": 0.000727}
[03/28 12:46:56] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "140", "eta": "3:20:38", "loss": 0.073202, "lr": 0.030277, "mode": "train", "time_backward": 1.053375, "time_data": 0.021378, "time_diff": 1.535552, "time_forward": 0.443685, "time_loss": 0.000337}
[03/28 12:47:11] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "150", "eta": "3:20:18", "loss": 0.078761, "lr": 0.030293, "mode": "train", "time_backward": 1.079791, "time_data": 0.016691, "time_diff": 1.537874, "time_forward": 0.398757, "time_loss": 0.000374}
[03/28 12:47:26] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "160", "eta": "3:19:57", "loss": 0.074188, "lr": 0.030310, "mode": "train", "time_backward": 1.058202, "time_data": 0.018822, "time_diff": 1.535579, "time_forward": 0.455697, "time_loss": 0.000253}
[03/28 12:47:44] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "170", "eta": "3:19:37", "loss": 0.073861, "lr": 0.030326, "mode": "train", "time_backward": 1.082316, "time_data": 0.023266, "time_diff": 1.508557, "time_forward": 0.402054, "time_loss": 0.000334}
[03/28 12:48:06] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "180", "eta": "3:19:14", "loss": 0.071057, "lr": 0.030342, "mode": "train", "time_backward": 1.100047, "time_data": 0.019542, "time_diff": 1.615589, "time_forward": 0.494824, "time_loss": 0.000358}
[03/28 12:48:21] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "190", "eta": "3:18:53", "loss": 0.070016, "lr": 0.030359, "mode": "train", "time_backward": 1.082896, "time_data": 0.017991, "time_diff": 1.513398, "time_forward": 0.408641, "time_loss": 0.000921}
[03/28 12:48:37] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "200", "eta": "3:18:32", "loss": 0.068102, "lr": 0.030375, "mode": "train", "time_backward": 1.062059, "time_data": 0.017418, "time_diff": 1.489775, "time_forward": 0.401420, "time_loss": 0.000277}
[03/28 12:48:52] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "210", "eta": "3:18:13", "loss": 0.069539, "lr": 0.030391, "mode": "train", "time_backward": 1.118167, "time_data": 0.019376, "time_diff": 1.542907, "time_forward": 0.404770, "time_loss": 0.000292}
[03/28 12:49:23] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "220", "eta": "3:17:52", "loss": 0.075607, "lr": 0.030408, "mode": "train", "time_backward": 1.054258, "time_data": 0.017118, "time_diff": 1.480221, "time_forward": 0.399976, "time_loss": 0.000440}
[03/28 12:49:41] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "230", "eta": "3:15:02", "loss": 0.075512, "lr": 0.030424, "mode": "train", "time_backward": 1.101603, "time_data": 0.017205, "time_diff": 1.585143, "time_forward": 0.400073, "time_loss": 0.000232}
[03/28 12:49:57] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "240", "eta": "3:14:42", "loss": 0.068357, "lr": 0.030440, "mode": "train", "time_backward": 1.095531, "time_data": 0.017585, "time_diff": 1.511908, "time_forward": 0.398306, "time_loss": 0.000241}
[03/28 12:50:19] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "250", "eta": "3:14:21", "loss": 0.070505, "lr": 0.030457, "mode": "train", "time_backward": 1.068983, "time_data": 0.017077, "time_diff": 1.499494, "time_forward": 0.398490, "time_loss": 0.000335}
[03/28 12:50:52] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "260", "eta": "3:17:31", "loss": 0.074988, "lr": 0.030473, "mode": "train", "time_backward": 18.960287, "time_data": 0.017705, "time_diff": 19.398154, "time_forward": 0.400731, "time_loss": 0.000660}
[03/28 12:51:07] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "270", "eta": "3:17:11", "loss": 0.071962, "lr": 0.030489, "mode": "train", "time_backward": 1.056693, "time_data": 0.017112, "time_diff": 1.508528, "time_forward": 0.430916, "time_loss": 0.000529}
[03/28 12:51:23] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "280", "eta": "3:16:52", "loss": 0.079356, "lr": 0.030505, "mode": "train", "time_backward": 1.057504, "time_data": 0.017467, "time_diff": 1.599858, "time_forward": 0.506370, "time_loss": 0.000315}
[03/28 12:51:38] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "290", "eta": "3:16:32", "loss": 0.075524, "lr": 0.030522, "mode": "train", "time_backward": 1.098274, "time_data": 0.018140, "time_diff": 1.574662, "time_forward": 0.401285, "time_loss": 0.000788}
[03/28 12:52:00] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "300", "eta": "3:16:13", "loss": 0.077612, "lr": 0.030538, "mode": "train", "time_backward": 1.126662, "time_data": 0.016967, "time_diff": 1.572803, "time_forward": 0.423085, "time_loss": 0.000407}
[03/28 12:52:45] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "310", "eta": "3:15:53", "loss": 0.077348, "lr": 0.030554, "mode": "train", "time_backward": 1.058422, "time_data": 0.017178, "time_diff": 1.482143, "time_forward": 0.398882, "time_loss": 0.000273}
[03/28 12:53:01] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "320", "eta": "3:15:32", "loss": 0.077356, "lr": 0.030571, "mode": "train", "time_backward": 1.060404, "time_data": 0.016864, "time_diff": 1.483477, "time_forward": 0.400765, "time_loss": 0.000312}
[03/28 12:53:18] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "330", "eta": "3:15:12", "loss": 0.068703, "lr": 0.030587, "mode": "train", "time_backward": 1.116609, "time_data": 0.017215, "time_diff": 1.535094, "time_forward": 0.399020, "time_loss": 0.000291}
[03/28 12:53:37] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "340", "eta": "3:15:37", "loss": 0.063387, "lr": 0.030603, "mode": "train", "time_backward": 5.022020, "time_data": 0.019548, "time_diff": 5.443523, "time_forward": 0.398575, "time_loss": 0.000273}
[03/28 12:53:54] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "350", "eta": "3:15:31", "loss": 0.076276, "lr": 0.030620, "mode": "train", "time_backward": 1.182482, "time_data": 0.018503, "time_diff": 2.670371, "time_forward": 1.366646, "time_loss": 0.018914}
[03/28 12:54:11] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "360", "eta": "3:15:11", "loss": 0.063585, "lr": 0.030636, "mode": "train", "time_backward": 1.082974, "time_data": 0.030716, "time_diff": 1.528470, "time_forward": 0.398778, "time_loss": 0.000367}
[03/28 12:54:26] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "370", "eta": "3:14:51", "loss": 0.078201, "lr": 0.030652, "mode": "train", "time_backward": 1.150714, "time_data": 0.017128, "time_diff": 1.570009, "time_forward": 0.398558, "time_loss": 0.000246}
[03/28 12:54:45] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "380", "eta": "3:12:47", "loss": 0.074039, "lr": 0.030669, "mode": "train", "time_backward": 1.298037, "time_data": 0.071603, "time_diff": 3.187294, "time_forward": 1.742403, "time_loss": 0.071906}
[03/28 12:55:05] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "390", "eta": "3:12:27", "loss": 0.077079, "lr": 0.030685, "mode": "train", "time_backward": 1.056601, "time_data": 0.017533, "time_diff": 1.481184, "time_forward": 0.403578, "time_loss": 0.000310}
[03/28 12:55:28] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "400", "eta": "3:12:06", "loss": 0.076640, "lr": 0.030701, "mode": "train", "time_backward": 1.112129, "time_data": 0.019101, "time_diff": 1.538755, "time_forward": 0.399239, "time_loss": 0.000222}
[03/28 12:55:46] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "410", "eta": "3:11:46", "loss": 0.075304, "lr": 0.030718, "mode": "train", "time_backward": 1.060089, "time_data": 0.019530, "time_diff": 1.537954, "time_forward": 0.431913, "time_loss": 0.000301}
[03/28 12:56:10] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "420", "eta": "3:11:26", "loss": 0.070288, "lr": 0.030734, "mode": "train", "time_backward": 1.125204, "time_data": 0.017984, "time_diff": 1.568000, "time_forward": 0.415514, "time_loss": 0.001819}
[03/28 12:56:25] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "430", "eta": "3:11:00", "loss": 0.069185, "lr": 0.030750, "mode": "train", "time_backward": 1.080576, "time_data": 0.018971, "time_diff": 1.503967, "time_forward": 0.401534, "time_loss": 0.000489}
[03/28 12:56:40] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "440", "eta": "3:10:33", "loss": 0.079289, "lr": 0.030767, "mode": "train", "time_backward": 1.056629, "time_data": 0.016762, "time_diff": 1.481197, "time_forward": 0.397710, "time_loss": 0.000241}
[03/28 12:57:09] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "450", "eta": "3:10:13", "loss": 0.069374, "lr": 0.030783, "mode": "train", "time_backward": 1.125955, "time_data": 0.027578, "time_diff": 1.581190, "time_forward": 0.407671, "time_loss": 0.000299}
[03/28 12:57:25] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "460", "eta": "3:05:06", "loss": 0.075373, "lr": 0.030799, "mode": "train", "time_backward": 1.078401, "time_data": 0.017267, "time_diff": 1.549245, "time_forward": 0.406596, "time_loss": 0.003453}
[03/28 12:57:42] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "470", "eta": "3:04:47", "loss": 0.066855, "lr": 0.030816, "mode": "train", "time_backward": 1.111385, "time_data": 0.017448, "time_diff": 1.558261, "time_forward": 0.423012, "time_loss": 0.000265}
[03/28 12:57:57] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "480", "eta": "2:51:21", "loss": 0.073954, "lr": 0.030832, "mode": "train", "time_backward": 1.137319, "time_data": 0.018100, "time_diff": 1.642052, "time_forward": 0.408953, "time_loss": 0.000337}
[03/28 12:58:18] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "490", "eta": "2:51:04", "loss": 0.070033, "lr": 0.030848, "mode": "train", "time_backward": 1.057262, "time_data": 0.016775, "time_diff": 1.569416, "time_forward": 0.398118, "time_loss": 0.000342}
[03/28 12:58:35] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "500", "eta": "2:51:02", "loss": 0.069469, "lr": 0.030865, "mode": "train", "time_backward": 2.390025, "time_data": 0.018611, "time_diff": 2.930258, "time_forward": 0.517979, "time_loss": 0.000409}
[03/28 12:58:57] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "510", "eta": "2:50:44", "loss": 0.072666, "lr": 0.030881, "mode": "train", "time_backward": 1.107121, "time_data": 0.018349, "time_diff": 1.578717, "time_forward": 0.424983, "time_loss": 0.000551}
[03/28 12:59:12] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "520", "eta": "2:50:21", "loss": 0.072023, "lr": 0.030897, "mode": "train", "time_backward": 1.062670, "time_data": 0.017554, "time_diff": 1.484761, "time_forward": 0.401003, "time_loss": 0.000461}
[03/28 12:59:28] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "530", "eta": "2:50:03", "loss": 0.076429, "lr": 0.030914, "mode": "train", "time_backward": 1.088365, "time_data": 0.030537, "time_diff": 1.526039, "time_forward": 0.399512, "time_loss": 0.000361}
[03/28 12:59:48] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "540", "eta": "2:45:53", "loss": 0.065757, "lr": 0.030930, "mode": "train", "time_backward": 1.056473, "time_data": 0.016828, "time_diff": 1.495569, "time_forward": 0.398001, "time_loss": 0.000255}
[03/28 13:00:15] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "550", "eta": "2:45:35", "loss": 0.070102, "lr": 0.030946, "mode": "train", "time_backward": 1.055399, "time_data": 0.018594, "time_diff": 1.521793, "time_forward": 0.401492, "time_loss": 0.000267}
[03/28 13:00:33] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "560", "eta": "2:45:09", "loss": 0.068422, "lr": 0.030963, "mode": "train", "time_backward": 1.082050, "time_data": 0.024301, "time_diff": 1.524013, "time_forward": 0.412941, "time_loss": 0.001575}
[03/28 13:00:49] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "570", "eta": "2:44:52", "loss": 0.073680, "lr": 0.030979, "mode": "train", "time_backward": 1.095894, "time_data": 0.018634, "time_diff": 1.523862, "time_forward": 0.402383, "time_loss": 0.000279}
[03/28 13:01:04] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "580", "eta": "2:44:34", "loss": 0.074387, "lr": 0.030995, "mode": "train", "time_backward": 1.059582, "time_data": 0.017525, "time_diff": 1.485783, "time_forward": 0.399092, "time_loss": 0.000315}
[03/28 13:01:20] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "590", "eta": "2:44:17", "loss": 0.066061, "lr": 0.031012, "mode": "train", "time_backward": 1.141735, "time_data": 0.019983, "time_diff": 1.585328, "time_forward": 0.420136, "time_loss": 0.000350}
[03/28 13:01:38] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "600", "eta": "2:43:59", "loss": 0.070802, "lr": 0.031028, "mode": "train", "time_backward": 1.077942, "time_data": 0.016914, "time_diff": 1.501872, "time_forward": 0.399125, "time_loss": 0.000410}
[03/28 13:01:54] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "610", "eta": "2:43:41", "loss": 0.075403, "lr": 0.031044, "mode": "train", "time_backward": 1.101303, "time_data": 0.019316, "time_diff": 1.549757, "time_forward": 0.406843, "time_loss": 0.018130}
[03/28 13:02:14] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "620", "eta": "2:43:26", "loss": 0.075809, "lr": 0.031061, "mode": "train", "time_backward": 1.224113, "time_data": 0.017031, "time_diff": 1.706136, "time_forward": 0.460892, "time_loss": 0.000771}
[03/28 13:02:30] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "630", "eta": "2:43:10", "loss": 0.079119, "lr": 0.031077, "mode": "train", "time_backward": 1.118962, "time_data": 0.017343, "time_diff": 1.714266, "time_forward": 0.571930, "time_loss": 0.000480}
[03/28 13:02:45] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "640", "eta": "2:42:53", "loss": 0.073690, "lr": 0.031093, "mode": "train", "time_backward": 1.082022, "time_data": 0.017035, "time_diff": 1.526929, "time_forward": 0.398099, "time_loss": 0.000262}
[03/28 13:03:03] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "650", "eta": "2:42:35", "loss": 0.069507, "lr": 0.031110, "mode": "train", "time_backward": 1.097681, "time_data": 0.016640, "time_diff": 1.519434, "time_forward": 0.398317, "time_loss": 0.000238}
[03/28 13:03:28] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "660", "eta": "2:44:06", "loss": 0.068252, "lr": 0.031126, "mode": "train", "time_backward": 11.086789, "time_data": 0.017010, "time_diff": 11.511868, "time_forward": 0.401261, "time_loss": 0.000278}
[03/28 13:03:44] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "670", "eta": "2:43:49", "loss": 0.070097, "lr": 0.031142, "mode": "train", "time_backward": 1.184352, "time_data": 0.032158, "time_diff": 1.648057, "time_forward": 0.419900, "time_loss": 0.001140}
[03/28 13:04:19] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "680", "eta": "2:43:30", "loss": 0.073276, "lr": 0.031159, "mode": "train", "time_backward": 1.057429, "time_data": 0.016996, "time_diff": 1.478821, "time_forward": 0.398371, "time_loss": 0.000219}
[03/28 13:04:55] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "690", "eta": "2:42:30", "loss": 0.075021, "lr": 0.031175, "mode": "train", "time_backward": 1.095525, "time_data": 0.030322, "time_diff": 1.839081, "time_forward": 0.697818, "time_loss": 0.000444}
[03/28 13:05:10] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "700", "eta": "2:42:11", "loss": 0.064496, "lr": 0.031191, "mode": "train", "time_backward": 1.064091, "time_data": 0.019361, "time_diff": 1.504230, "time_forward": 0.412723, "time_loss": 0.005638}
[03/28 13:05:40] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "710", "eta": "2:44:40", "loss": 0.067647, "lr": 0.031207, "mode": "train", "time_backward": 1.097822, "time_data": 15.368823, "time_diff": 16.980405, "time_forward": 0.435072, "time_loss": 0.000461}
[03/28 13:05:56] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "720", "eta": "2:44:22", "loss": 0.077760, "lr": 0.031224, "mode": "train", "time_backward": 1.059309, "time_data": 0.028662, "time_diff": 1.516490, "time_forward": 0.420069, "time_loss": 0.000371}
[03/28 13:06:13] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "730", "eta": "2:44:03", "loss": 0.071546, "lr": 0.031240, "mode": "train", "time_backward": 1.066620, "time_data": 0.020594, "time_diff": 1.541401, "time_forward": 0.416455, "time_loss": 0.000382}
[03/28 13:06:28] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "740", "eta": "2:43:45", "loss": 0.073660, "lr": 0.031256, "mode": "train", "time_backward": 1.062350, "time_data": 0.019238, "time_diff": 1.490534, "time_forward": 0.405459, "time_loss": 0.000279}
[03/28 13:07:05] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "750", "eta": "2:43:26", "loss": 0.075311, "lr": 0.031273, "mode": "train", "time_backward": 1.065638, "time_data": 0.019066, "time_diff": 1.487646, "time_forward": 0.399263, "time_loss": 0.000389}
[03/28 13:07:20] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "760", "eta": "2:43:09", "loss": 0.074994, "lr": 0.031289, "mode": "train", "time_backward": 1.063412, "time_data": 0.024949, "time_diff": 1.550615, "time_forward": 0.401642, "time_loss": 0.000261}
[03/28 13:07:36] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "770", "eta": "2:42:52", "loss": 0.067594, "lr": 0.031305, "mode": "train", "time_backward": 1.052365, "time_data": 0.017130, "time_diff": 1.637897, "time_forward": 0.542937, "time_loss": 0.000289}
[03/28 13:07:51] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "780", "eta": "2:39:19", "loss": 0.074901, "lr": 0.031322, "mode": "train", "time_backward": 1.222998, "time_data": 0.019080, "time_diff": 1.657901, "time_forward": 0.404727, "time_loss": 0.000354}
[03/28 13:08:09] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "790", "eta": "2:39:03", "loss": 0.069965, "lr": 0.031338, "mode": "train", "time_backward": 1.166958, "time_data": 0.017234, "time_diff": 1.624640, "time_forward": 0.398121, "time_loss": 0.000243}
[03/28 13:08:25] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "800", "eta": "2:38:45", "loss": 0.069377, "lr": 0.031354, "mode": "train", "time_backward": 1.100840, "time_data": 0.017244, "time_diff": 1.525513, "time_forward": 0.399937, "time_loss": 0.000391}
[03/28 13:08:41] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "810", "eta": "2:38:27", "loss": 0.073159, "lr": 0.031371, "mode": "train", "time_backward": 1.056332, "time_data": 0.017112, "time_diff": 1.482644, "time_forward": 0.400013, "time_loss": 0.000378}
[03/28 13:08:57] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "820", "eta": "2:38:14", "loss": 0.067264, "lr": 0.031387, "mode": "train", "time_backward": 1.467089, "time_data": 0.025141, "time_diff": 1.993937, "time_forward": 0.408083, "time_loss": 0.000526}
[03/28 13:09:13] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "830", "eta": "2:37:56", "loss": 0.079818, "lr": 0.031403, "mode": "train", "time_backward": 1.087744, "time_data": 0.020613, "time_diff": 1.525420, "time_forward": 0.407843, "time_loss": 0.000235}
[03/28 13:09:33] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "840", "eta": "2:37:38", "loss": 0.072539, "lr": 0.031420, "mode": "train", "time_backward": 1.068400, "time_data": 0.018244, "time_diff": 1.520315, "time_forward": 0.430377, "time_loss": 0.000348}
[03/28 13:09:53] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "850", "eta": "2:37:22", "loss": 0.075695, "lr": 0.031436, "mode": "train", "time_backward": 1.109612, "time_data": 0.017619, "time_diff": 1.732266, "time_forward": 0.411553, "time_loss": 0.006716}
[03/28 13:10:11] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "860", "eta": "2:37:04", "loss": 0.073919, "lr": 0.031452, "mode": "train", "time_backward": 1.069729, "time_data": 0.017763, "time_diff": 1.514712, "time_forward": 0.400443, "time_loss": 0.000329}
[03/28 13:10:27] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "870", "eta": "2:36:47", "loss": 0.070479, "lr": 0.031469, "mode": "train", "time_backward": 1.081173, "time_data": 0.016918, "time_diff": 1.634429, "time_forward": 0.402299, "time_loss": 0.001475}
[03/28 13:10:46] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "880", "eta": "2:36:29", "loss": 0.069070, "lr": 0.031485, "mode": "train", "time_backward": 1.082992, "time_data": 0.018317, "time_diff": 1.510898, "time_forward": 0.400464, "time_loss": 0.000322}
[03/28 13:11:09] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "890", "eta": "2:36:04", "loss": 0.072419, "lr": 0.031501, "mode": "train", "time_backward": 1.161659, "time_data": 0.024725, "time_diff": 1.604532, "time_forward": 0.411668, "time_loss": 0.000341}
[03/28 13:11:25] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "900", "eta": "2:35:46", "loss": 0.074208, "lr": 0.031518, "mode": "train", "time_backward": 1.061134, "time_data": 0.016912, "time_diff": 1.484285, "time_forward": 0.399054, "time_loss": 0.000385}
[03/28 13:12:13] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "910", "eta": "2:35:28", "loss": 0.075947, "lr": 0.031534, "mode": "train", "time_backward": 1.056580, "time_data": 0.017400, "time_diff": 1.484248, "time_forward": 0.400564, "time_loss": 0.000272}
[03/28 13:12:29] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "920", "eta": "2:35:10", "loss": 0.073824, "lr": 0.031550, "mode": "train", "time_backward": 1.059328, "time_data": 0.033119, "time_diff": 1.548140, "time_forward": 0.451624, "time_loss": 0.000761}
[03/28 13:12:43] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "930", "eta": "2:34:53", "loss": 0.069824, "lr": 0.031567, "mode": "train", "time_backward": 1.102137, "time_data": 0.017487, "time_diff": 1.525873, "time_forward": 0.400044, "time_loss": 0.000324}
[03/28 13:13:23] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "940", "eta": "2:34:41", "loss": 0.072762, "lr": 0.031583, "mode": "train", "time_backward": 1.131389, "time_data": 0.016970, "time_diff": 2.023405, "time_forward": 0.866903, "time_loss": 0.000271}
[03/28 13:13:40] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "950", "eta": "2:34:39", "loss": 0.069815, "lr": 0.031599, "mode": "train", "time_backward": 1.053552, "time_data": 0.019167, "time_diff": 3.099730, "time_forward": 0.398795, "time_loss": 0.000259}
[03/28 13:14:01] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "960", "eta": "2:34:22", "loss": 0.072082, "lr": 0.031616, "mode": "train", "time_backward": 1.066796, "time_data": 0.024034, "time_diff": 1.529259, "time_forward": 0.417981, "time_loss": 0.000285}
[03/28 13:14:23] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "970", "eta": "2:35:10", "loss": 0.070944, "lr": 0.031632, "mode": "train", "time_backward": 7.032023, "time_data": 0.017417, "time_diff": 7.920615, "time_forward": 0.399279, "time_loss": 0.000286}
[03/28 13:14:39] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "980", "eta": "2:34:53", "loss": 0.072182, "lr": 0.031648, "mode": "train", "time_backward": 1.148368, "time_data": 0.019570, "time_diff": 1.589301, "time_forward": 0.413915, "time_loss": 0.000378}
[03/28 13:15:07] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "990", "eta": "2:34:34", "loss": 0.069251, "lr": 0.031665, "mode": "train", "time_backward": 1.056559, "time_data": 0.017068, "time_diff": 1.482056, "time_forward": 0.398649, "time_loss": 0.000237}
[03/28 13:15:23] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "1000", "eta": "2:34:18", "loss": 0.073359, "lr": 0.031681, "mode": "train", "time_backward": 1.173983, "time_data": 0.017285, "time_diff": 1.631566, "time_forward": 0.437188, "time_loss": 0.000276}
[03/28 13:15:46] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "1010", "eta": "2:34:00", "loss": 0.077489, "lr": 0.031697, "mode": "train", "time_backward": 1.056934, "time_data": 0.063186, "time_diff": 1.524528, "time_forward": 0.400845, "time_loss": 0.000325}
[03/28 13:16:06] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "1020", "eta": "2:33:43", "loss": 0.065054, "lr": 0.031714, "mode": "train", "time_backward": 1.149836, "time_data": 0.018819, "time_diff": 1.582392, "time_forward": 0.399213, "time_loss": 0.000257}
[03/28 13:16:21] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "1030", "eta": "2:33:24", "loss": 0.066166, "lr": 0.031730, "mode": "train", "time_backward": 1.075991, "time_data": 0.027844, "time_diff": 1.535215, "time_forward": 0.409474, "time_loss": 0.000243}
[03/28 13:16:37] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "1040", "eta": "2:33:10", "loss": 0.075930, "lr": 0.031746, "mode": "train", "time_backward": 1.370739, "time_data": 0.017532, "time_diff": 1.814113, "time_forward": 0.400579, "time_loss": 0.000980}
[03/28 13:17:04] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "1050", "eta": "2:34:50", "loss": 0.075945, "lr": 0.031763, "mode": "train", "time_backward": 12.778037, "time_data": 0.030948, "time_diff": 13.272897, "time_forward": 0.433227, "time_loss": 0.000461}
[03/28 13:17:19] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "1060", "eta": "2:34:32", "loss": 0.070251, "lr": 0.031779, "mode": "train", "time_backward": 1.059127, "time_data": 0.017409, "time_diff": 1.481742, "time_forward": 0.401336, "time_loss": 0.000578}
[03/28 13:18:22] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "1070", "eta": "2:34:13", "loss": 0.065739, "lr": 0.031795, "mode": "train", "time_backward": 1.063052, "time_data": 0.017243, "time_diff": 1.485320, "time_forward": 0.402194, "time_loss": 0.000322}
[03/28 13:18:38] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "1080", "eta": "2:33:55", "loss": 0.073579, "lr": 0.031812, "mode": "train", "time_backward": 1.065090, "time_data": 0.016994, "time_diff": 1.485195, "time_forward": 0.399453, "time_loss": 0.000456}
[03/28 13:18:52] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "1090", "eta": "2:33:36", "loss": 0.075410, "lr": 0.031828, "mode": "train", "time_backward": 1.071437, "time_data": 0.017368, "time_diff": 1.492839, "time_forward": 0.399050, "time_loss": 0.000260}
[03/28 13:19:10] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "1100", "eta": "2:33:18", "loss": 0.079081, "lr": 0.031844, "mode": "train", "time_backward": 1.059849, "time_data": 0.017117, "time_diff": 1.482453, "time_forward": 0.398797, "time_loss": 0.000263}
[03/28 13:19:26] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "1110", "eta": "2:33:00", "loss": 0.068968, "lr": 0.031861, "mode": "train", "time_backward": 1.096969, "time_data": 0.019992, "time_diff": 1.544450, "time_forward": 0.426972, "time_loss": 0.000245}
[03/28 13:19:44] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "1120", "eta": "2:31:46", "loss": 0.073650, "lr": 0.031877, "mode": "train", "time_backward": 1.065995, "time_data": 0.017531, "time_diff": 1.490966, "time_forward": 0.403645, "time_loss": 0.000387}
[03/28 13:20:12] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "1130", "eta": "2:33:30", "loss": 0.070271, "lr": 0.031893, "mode": "train", "time_backward": 13.317526, "time_data": 0.019206, "time_diff": 13.743718, "time_forward": 0.399968, "time_loss": 0.000351}
[03/28 13:20:27] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "1140", "eta": "2:33:11", "loss": 0.073778, "lr": 0.031909, "mode": "train", "time_backward": 1.074982, "time_data": 0.016883, "time_diff": 1.501468, "time_forward": 0.398196, "time_loss": 0.000275}
[03/28 13:21:29] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "1150", "eta": "2:32:53", "loss": 0.074108, "lr": 0.031926, "mode": "train", "time_backward": 1.056861, "time_data": 0.017311, "time_diff": 1.557840, "time_forward": 0.475373, "time_loss": 0.000280}
[03/28 13:21:44] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "1160", "eta": "2:32:34", "loss": 0.068657, "lr": 0.031942, "mode": "train", "time_backward": 1.089251, "time_data": 0.019616, "time_diff": 1.516465, "time_forward": 0.399731, "time_loss": 0.000328}
[03/28 13:22:00] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "1170", "eta": "2:32:15", "loss": 0.070640, "lr": 0.031958, "mode": "train", "time_backward": 1.059144, "time_data": 0.018144, "time_diff": 1.500418, "time_forward": 0.402898, "time_loss": 0.000418}
[03/28 13:22:17] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "1180", "eta": "2:31:57", "loss": 0.068791, "lr": 0.031975, "mode": "train", "time_backward": 1.098461, "time_data": 0.017138, "time_diff": 1.520134, "time_forward": 0.401063, "time_loss": 0.000232}
[03/28 13:22:32] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "1190", "eta": "2:31:38", "loss": 0.075350, "lr": 0.031991, "mode": "train", "time_backward": 1.055625, "time_data": 0.018717, "time_diff": 1.479214, "time_forward": 0.399779, "time_loss": 0.000366}
[03/28 13:23:01] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "1200", "eta": "2:31:19", "loss": 0.066266, "lr": 0.032007, "mode": "train", "time_backward": 1.068215, "time_data": 0.016839, "time_diff": 1.490143, "time_forward": 0.399823, "time_loss": 0.000276}
[03/28 13:23:16] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "1210", "eta": "2:31:00", "loss": 0.071651, "lr": 0.032024, "mode": "train", "time_backward": 1.068365, "time_data": 0.019892, "time_diff": 1.518176, "time_forward": 0.429121, "time_loss": 0.000404}
[03/28 13:23:31] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "1220", "eta": "2:30:42", "loss": 0.071457, "lr": 0.032040, "mode": "train", "time_backward": 1.076168, "time_data": 0.026355, "time_diff": 1.511421, "time_forward": 0.398008, "time_loss": 0.000234}
[03/28 13:23:47] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "1230", "eta": "2:30:24", "loss": 0.073174, "lr": 0.032056, "mode": "train", "time_backward": 1.071567, "time_data": 0.016864, "time_diff": 1.589372, "time_forward": 0.500364, "time_loss": 0.000255}
[03/28 13:24:03] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "1240", "eta": "2:30:06", "loss": 0.071849, "lr": 0.032073, "mode": "train", "time_backward": 1.075125, "time_data": 0.017372, "time_diff": 1.522731, "time_forward": 0.400479, "time_loss": 0.000724}
[03/28 13:24:20] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "1250", "eta": "2:29:47", "loss": 0.076103, "lr": 0.032089, "mode": "train", "time_backward": 1.054992, "time_data": 0.017665, "time_diff": 1.482699, "time_forward": 0.400751, "time_loss": 0.000343}
[03/28 13:24:38] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "1260", "eta": "2:29:28", "loss": 0.066907, "lr": 0.032105, "mode": "train", "time_backward": 1.063795, "time_data": 0.018129, "time_diff": 1.484566, "time_forward": 0.399703, "time_loss": 0.000263}
[03/28 13:25:56] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "1270", "eta": "2:29:10", "loss": 0.074434, "lr": 0.032122, "mode": "train", "time_backward": 1.102277, "time_data": 0.019922, "time_diff": 1.533597, "time_forward": 0.402014, "time_loss": 0.000403}
[03/28 13:26:11] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "1280", "eta": "2:28:52", "loss": 0.064094, "lr": 0.032138, "mode": "train", "time_backward": 1.056559, "time_data": 0.017210, "time_diff": 1.528917, "time_forward": 0.451531, "time_loss": 0.000396}
[03/28 13:26:26] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "1290", "eta": "2:28:33", "loss": 0.070271, "lr": 0.032154, "mode": "train", "time_backward": 1.056728, "time_data": 0.017415, "time_diff": 1.480330, "time_forward": 0.399787, "time_loss": 0.000427}
[03/28 13:26:53] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "1300", "eta": "2:28:20", "loss": 0.070466, "lr": 0.032171, "mode": "train", "time_backward": 1.600669, "time_data": 0.019976, "time_diff": 2.109276, "time_forward": 0.433854, "time_loss": 0.000424}
[03/28 13:27:12] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "1310", "eta": "2:28:02", "loss": 0.068036, "lr": 0.032187, "mode": "train", "time_backward": 1.103231, "time_data": 0.017120, "time_diff": 1.528047, "time_forward": 0.399149, "time_loss": 0.000369}
[03/28 13:27:36] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "1320", "eta": "2:27:44", "loss": 0.068389, "lr": 0.032203, "mode": "train", "time_backward": 1.135884, "time_data": 0.018980, "time_diff": 1.563056, "time_forward": 0.398609, "time_loss": 0.000236}
[03/28 13:27:51] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "1330", "eta": "2:27:25", "loss": 0.074390, "lr": 0.032220, "mode": "train", "time_backward": 1.056436, "time_data": 0.016847, "time_diff": 1.491743, "time_forward": 0.398492, "time_loss": 0.000355}
[03/28 13:28:20] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "1340", "eta": "2:27:07", "loss": 0.067884, "lr": 0.032236, "mode": "train", "time_backward": 1.056263, "time_data": 0.019276, "time_diff": 1.480630, "time_forward": 0.398735, "time_loss": 0.000221}
[03/28 13:29:54] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "1350", "eta": "2:35:13", "loss": 0.070064, "lr": 0.032252, "mode": "train", "time_backward": 54.124249, "time_data": 0.017389, "time_diff": 54.548518, "time_forward": 0.400983, "time_loss": 0.000368}
[03/28 13:30:11] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "1360", "eta": "2:34:54", "loss": 0.072860, "lr": 0.032269, "mode": "train", "time_backward": 1.057915, "time_data": 0.017301, "time_diff": 1.484565, "time_forward": 0.400182, "time_loss": 0.000338}
[03/28 13:30:31] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "1370", "eta": "2:34:34", "loss": 0.076288, "lr": 0.032285, "mode": "train", "time_backward": 1.178360, "time_data": 0.019032, "time_diff": 1.601583, "time_forward": 0.401484, "time_loss": 0.000268}
[03/28 13:30:47] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "1380", "eta": "2:34:14", "loss": 0.068431, "lr": 0.032301, "mode": "train", "time_backward": 1.077266, "time_data": 0.017330, "time_diff": 1.507482, "time_forward": 0.409297, "time_loss": 0.000269}
[03/28 13:31:03] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "1390", "eta": "2:33:57", "loss": 0.069660, "lr": 0.032318, "mode": "train", "time_backward": 1.142235, "time_data": 0.063741, "time_diff": 1.771622, "time_forward": 0.539019, "time_loss": 0.001447}
[03/28 13:31:19] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "1400", "eta": "2:33:38", "loss": 0.067188, "lr": 0.032334, "mode": "train", "time_backward": 1.096860, "time_data": 0.017618, "time_diff": 1.522854, "time_forward": 0.401755, "time_loss": 0.000260}
[03/28 13:31:34] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "1410", "eta": "2:33:18", "loss": 0.074901, "lr": 0.032350, "mode": "train", "time_backward": 1.057066, "time_data": 0.017408, "time_diff": 1.482330, "time_forward": 0.399063, "time_loss": 0.000266}
[03/28 13:32:18] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "1420", "eta": "2:32:58", "loss": 0.071104, "lr": 0.032367, "mode": "train", "time_backward": 1.063978, "time_data": 0.020135, "time_diff": 1.556991, "time_forward": 0.416203, "time_loss": 0.001259}
[03/28 13:32:34] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "1430", "eta": "2:32:39", "loss": 0.071708, "lr": 0.032383, "mode": "train", "time_backward": 1.056489, "time_data": 0.017366, "time_diff": 1.528946, "time_forward": 0.446626, "time_loss": 0.000614}
[03/28 13:32:49] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "1440", "eta": "2:32:19", "loss": 0.067713, "lr": 0.032399, "mode": "train", "time_backward": 1.062294, "time_data": 0.017160, "time_diff": 1.485453, "time_forward": 0.399321, "time_loss": 0.000331}
[03/28 13:33:48] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "1450", "eta": "2:31:59", "loss": 0.069990, "lr": 0.032416, "mode": "train", "time_backward": 1.067409, "time_data": 0.016908, "time_diff": 1.498888, "time_forward": 0.398587, "time_loss": 0.000265}
[03/28 13:34:03] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "1460", "eta": "2:31:39", "loss": 0.076467, "lr": 0.032432, "mode": "train", "time_backward": 1.067563, "time_data": 0.017713, "time_diff": 1.498444, "time_forward": 0.399744, "time_loss": 0.000262}
[03/28 13:34:19] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "1470", "eta": "2:31:19", "loss": 0.068177, "lr": 0.032448, "mode": "train", "time_backward": 1.056157, "time_data": 0.017030, "time_diff": 1.482808, "time_forward": 0.401952, "time_loss": 0.000376}
[03/28 13:34:39] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "1480", "eta": "2:30:56", "loss": 0.069247, "lr": 0.032465, "mode": "train", "time_backward": 1.119580, "time_data": 0.018821, "time_diff": 1.545845, "time_forward": 0.399647, "time_loss": 0.000301}
[03/28 13:34:54] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "1490", "eta": "2:30:36", "loss": 0.068226, "lr": 0.032481, "mode": "train", "time_backward": 1.056287, "time_data": 0.018342, "time_diff": 1.482220, "time_forward": 0.404072, "time_loss": 0.000360}
[03/28 13:35:10] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "1500", "eta": "2:30:16", "loss": 0.066333, "lr": 0.032497, "mode": "train", "time_backward": 1.066458, "time_data": 0.019132, "time_diff": 1.496486, "time_forward": 0.398999, "time_loss": 0.000354}
[03/28 13:35:38] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "1510", "eta": "2:29:56", "loss": 0.068454, "lr": 0.032514, "mode": "train", "time_backward": 1.055836, "time_data": 0.019951, "time_diff": 1.483980, "time_forward": 0.399501, "time_loss": 0.000242}
[03/28 13:35:54] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "1520", "eta": "2:29:37", "loss": 0.070722, "lr": 0.032530, "mode": "train", "time_backward": 1.066540, "time_data": 0.024876, "time_diff": 1.570000, "time_forward": 0.474968, "time_loss": 0.000301}
[03/28 13:36:09] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "1530", "eta": "2:29:18", "loss": 0.068227, "lr": 0.032546, "mode": "train", "time_backward": 1.113614, "time_data": 0.016976, "time_diff": 1.572915, "time_forward": 0.399002, "time_loss": 0.000242}
[03/28 13:36:31] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "1540", "eta": "2:29:02", "loss": 0.067511, "lr": 0.032563, "mode": "train", "time_backward": 1.117417, "time_data": 0.031829, "time_diff": 1.845181, "time_forward": 0.682152, "time_loss": 0.003647}
[03/28 13:37:00] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "1550", "eta": "2:28:42", "loss": 0.074806, "lr": 0.032579, "mode": "train", "time_backward": 1.095758, "time_data": 0.022813, "time_diff": 1.533306, "time_forward": 0.407681, "time_loss": 0.000403}
[03/28 13:37:16] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "1560", "eta": "2:28:21", "loss": 0.064783, "lr": 0.032595, "mode": "train", "time_backward": 1.079503, "time_data": 0.017680, "time_diff": 1.504808, "time_forward": 0.404237, "time_loss": 0.000597}
[03/28 13:37:31] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "1570", "eta": "2:28:02", "loss": 0.067197, "lr": 0.032612, "mode": "train", "time_backward": 1.063720, "time_data": 0.018745, "time_diff": 1.580536, "time_forward": 0.404368, "time_loss": 0.000374}
[03/28 13:37:49] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "1580", "eta": "2:27:42", "loss": 0.068788, "lr": 0.032628, "mode": "train", "time_backward": 1.058143, "time_data": 0.017741, "time_diff": 1.482100, "time_forward": 0.399594, "time_loss": 0.000266}
[03/28 13:38:09] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "1590", "eta": "2:27:59", "loss": 0.068297, "lr": 0.032644, "mode": "train", "time_backward": 5.195825, "time_data": 0.016555, "time_diff": 5.655574, "time_forward": 0.400319, "time_loss": 0.000377}
[03/28 13:38:33] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "1600", "eta": "2:27:39", "loss": 0.071207, "lr": 0.032660, "mode": "train", "time_backward": 1.069984, "time_data": 0.017009, "time_diff": 1.491240, "time_forward": 0.400789, "time_loss": 0.000381}
[03/28 13:38:54] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "1610", "eta": "2:27:18", "loss": 0.069041, "lr": 0.032677, "mode": "train", "time_backward": 1.058168, "time_data": 0.017846, "time_diff": 1.483615, "time_forward": 0.400333, "time_loss": 0.000385}
[03/28 13:39:39] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "1620", "eta": "2:26:59", "loss": 0.072114, "lr": 0.032693, "mode": "train", "time_backward": 1.055322, "time_data": 0.019023, "time_diff": 1.514255, "time_forward": 0.436277, "time_loss": 0.000274}
[03/28 13:40:00] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "1630", "eta": "2:26:38", "loss": 0.067755, "lr": 0.032709, "mode": "train", "time_backward": 1.057117, "time_data": 0.024286, "time_diff": 1.499358, "time_forward": 0.401056, "time_loss": 0.000363}
[03/28 13:40:23] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "1640", "eta": "2:26:16", "loss": 0.064558, "lr": 0.032726, "mode": "train", "time_backward": 1.090845, "time_data": 0.017023, "time_diff": 1.516709, "time_forward": 0.400833, "time_loss": 0.000422}
[03/28 13:40:48] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "1650", "eta": "2:26:39", "loss": 0.064714, "lr": 0.032742, "mode": "train", "time_backward": 5.750524, "time_data": 0.046803, "time_diff": 6.213787, "time_forward": 0.415128, "time_loss": 0.000369}
[03/28 13:41:03] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "1660", "eta": "2:26:19", "loss": 0.074417, "lr": 0.032758, "mode": "train", "time_backward": 1.097663, "time_data": 0.024001, "time_diff": 1.527860, "time_forward": 0.400074, "time_loss": 0.000271}
[03/28 13:41:31] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "1670", "eta": "2:25:59", "loss": 0.068285, "lr": 0.032775, "mode": "train", "time_backward": 1.066948, "time_data": 0.017498, "time_diff": 1.487580, "time_forward": 0.399111, "time_loss": 0.000253}
[03/28 13:41:46] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "1680", "eta": "2:25:39", "loss": 0.068150, "lr": 0.032791, "mode": "train", "time_backward": 1.068840, "time_data": 0.018550, "time_diff": 1.496435, "time_forward": 0.401859, "time_loss": 0.000323}
[03/28 13:42:05] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "1690", "eta": "2:25:20", "loss": 0.066063, "lr": 0.032807, "mode": "train", "time_backward": 1.099035, "time_data": 0.017741, "time_diff": 1.524963, "time_forward": 0.401780, "time_loss": 0.000416}
[03/28 13:42:49] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "1700", "eta": "2:23:21", "loss": 0.075568, "lr": 0.032824, "mode": "train", "time_backward": 1.057949, "time_data": 0.017401, "time_diff": 1.480519, "time_forward": 0.399015, "time_loss": 0.000247}
[03/28 13:43:04] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "1710", "eta": "2:23:02", "loss": 0.071494, "lr": 0.032840, "mode": "train", "time_backward": 1.054808, "time_data": 0.020567, "time_diff": 1.485610, "time_forward": 0.402487, "time_loss": 0.000267}
[03/28 13:43:35] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "1720", "eta": "2:22:42", "loss": 0.068318, "lr": 0.032856, "mode": "train", "time_backward": 1.109212, "time_data": 0.016906, "time_diff": 1.532544, "time_forward": 0.399594, "time_loss": 0.000371}
[03/28 13:43:51] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "1730", "eta": "2:22:23", "loss": 0.068650, "lr": 0.032873, "mode": "train", "time_backward": 1.068796, "time_data": 0.017881, "time_diff": 1.489974, "time_forward": 0.399823, "time_loss": 0.000281}
[03/28 13:44:06] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "1740", "eta": "2:22:02", "loss": 0.070338, "lr": 0.032889, "mode": "train", "time_backward": 1.064154, "time_data": 0.016933, "time_diff": 1.484905, "time_forward": 0.400050, "time_loss": 0.000413}
[03/28 13:44:21] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "1750", "eta": "2:21:43", "loss": 0.060385, "lr": 0.032905, "mode": "train", "time_backward": 1.082293, "time_data": 0.018492, "time_diff": 1.527961, "time_forward": 0.417024, "time_loss": 0.000401}
[03/28 13:44:56] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "1760", "eta": "2:21:23", "loss": 0.066021, "lr": 0.032922, "mode": "train", "time_backward": 1.054608, "time_data": 0.018320, "time_diff": 1.477289, "time_forward": 0.399951, "time_loss": 0.000516}
[03/28 13:45:44] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "1770", "eta": "2:25:39", "loss": 0.070146, "lr": 0.032938, "mode": "train", "time_backward": 32.865069, "time_data": 0.017442, "time_diff": 33.287124, "time_forward": 0.397917, "time_loss": 0.000259}
[03/28 13:46:02] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "1780", "eta": "2:25:20", "loss": 0.072392, "lr": 0.032954, "mode": "train", "time_backward": 1.154170, "time_data": 0.022071, "time_diff": 1.633103, "time_forward": 0.406094, "time_loss": 0.000285}
[03/28 13:46:20] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "1790", "eta": "2:25:00", "loss": 0.066118, "lr": 0.032971, "mode": "train", "time_backward": 1.105486, "time_data": 0.044576, "time_diff": 1.589686, "time_forward": 0.424676, "time_loss": 0.000273}
[03/28 13:46:41] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "1800", "eta": "2:24:40", "loss": 0.073798, "lr": 0.032987, "mode": "train", "time_backward": 1.055497, "time_data": 0.017636, "time_diff": 1.478527, "time_forward": 0.399888, "time_loss": 0.000387}
[03/28 13:47:02] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "1810", "eta": "2:24:19", "loss": 0.067017, "lr": 0.033003, "mode": "train", "time_backward": 1.079183, "time_data": 0.018053, "time_diff": 1.505562, "time_forward": 0.399470, "time_loss": 0.000356}
[03/28 13:47:35] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "1820", "eta": "2:24:35", "loss": 0.064825, "lr": 0.033020, "mode": "train", "time_backward": 5.354733, "time_data": 0.016889, "time_diff": 5.779056, "time_forward": 0.400542, "time_loss": 0.000371}
[03/28 13:47:50] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "1830", "eta": "2:24:15", "loss": 0.078177, "lr": 0.033036, "mode": "train", "time_backward": 1.057209, "time_data": 0.017138, "time_diff": 1.484817, "time_forward": 0.399656, "time_loss": 0.000338}
[03/28 13:48:12] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "1840", "eta": "2:24:01", "loss": 0.064436, "lr": 0.033052, "mode": "train", "time_backward": 1.162125, "time_data": 0.024083, "time_diff": 2.392284, "time_forward": 1.168564, "time_loss": 0.000315}
[03/28 13:48:27] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "1850", "eta": "2:23:40", "loss": 0.061850, "lr": 0.033069, "mode": "train", "time_backward": 1.107257, "time_data": 0.017871, "time_diff": 1.533624, "time_forward": 0.403781, "time_loss": 0.000499}
[03/28 13:48:42] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "1860", "eta": "2:23:20", "loss": 0.066513, "lr": 0.033085, "mode": "train", "time_backward": 1.057752, "time_data": 0.018926, "time_diff": 1.486231, "time_forward": 0.400976, "time_loss": 0.000334}
[03/28 13:49:46] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "1870", "eta": "2:22:59", "loss": 0.067451, "lr": 0.033101, "mode": "train", "time_backward": 1.060169, "time_data": 0.017829, "time_diff": 1.486765, "time_forward": 0.401469, "time_loss": 0.000379}
[03/28 13:50:01] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "1880", "eta": "2:22:39", "loss": 0.064311, "lr": 0.033118, "mode": "train", "time_backward": 1.059084, "time_data": 0.017162, "time_diff": 1.481429, "time_forward": 0.400092, "time_loss": 0.000370}
[03/28 13:50:36] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "1890", "eta": "2:22:18", "loss": 0.068424, "lr": 0.033134, "mode": "train", "time_backward": 1.055207, "time_data": 0.016983, "time_diff": 1.474994, "time_forward": 0.399070, "time_loss": 0.000329}
[03/28 13:50:55] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "1900", "eta": "2:21:58", "loss": 0.062197, "lr": 0.033150, "mode": "train", "time_backward": 1.068144, "time_data": 0.018533, "time_diff": 1.498919, "time_forward": 0.399609, "time_loss": 0.000412}
[03/28 13:51:13] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "1910", "eta": "2:21:38", "loss": 0.075043, "lr": 0.033167, "mode": "train", "time_backward": 1.101154, "time_data": 0.018778, "time_diff": 1.540848, "time_forward": 0.399754, "time_loss": 0.000284}
[03/28 13:51:47] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "1920", "eta": "2:22:32", "loss": 0.071814, "lr": 0.033183, "mode": "train", "time_backward": 1.057773, "time_data": 8.858579, "time_diff": 10.331501, "time_forward": 0.411658, "time_loss": 0.000285}
[03/28 13:52:03] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "1930", "eta": "2:22:12", "loss": 0.065400, "lr": 0.033199, "mode": "train", "time_backward": 1.081652, "time_data": 0.021038, "time_diff": 1.507433, "time_forward": 0.401521, "time_loss": 0.000285}
[03/28 13:52:18] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "1940", "eta": "2:21:51", "loss": 0.073712, "lr": 0.033216, "mode": "train", "time_backward": 1.059744, "time_data": 0.020075, "time_diff": 1.500796, "time_forward": 0.416776, "time_loss": 0.000469}
[03/28 13:52:33] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "1950", "eta": "2:21:29", "loss": 0.073224, "lr": 0.033232, "mode": "train", "time_backward": 1.055993, "time_data": 0.021331, "time_diff": 1.481058, "time_forward": 0.399616, "time_loss": 0.000392}
[03/28 13:52:59] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "1960", "eta": "2:21:10", "loss": 0.064553, "lr": 0.033248, "mode": "train", "time_backward": 1.142626, "time_data": 0.030994, "time_diff": 1.611095, "time_forward": 0.430149, "time_loss": 0.000269}
[03/28 13:53:18] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "1970", "eta": "2:20:49", "loss": 0.067098, "lr": 0.033265, "mode": "train", "time_backward": 1.084316, "time_data": 0.024084, "time_diff": 1.533946, "time_forward": 0.418111, "time_loss": 0.000308}
[03/28 13:53:36] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "1980", "eta": "2:20:29", "loss": 0.068555, "lr": 0.033281, "mode": "train", "time_backward": 1.056898, "time_data": 0.017018, "time_diff": 1.484126, "time_forward": 0.406989, "time_loss": 0.000370}
[03/28 13:54:16] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "1990", "eta": "2:20:08", "loss": 0.066796, "lr": 0.033297, "mode": "train", "time_backward": 1.056378, "time_data": 0.019433, "time_diff": 1.483117, "time_forward": 0.402788, "time_loss": 0.001061}
[03/28 13:55:01] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "2000", "eta": "2:19:47", "loss": 0.068249, "lr": 0.033314, "mode": "train", "time_backward": 1.057076, "time_data": 0.018650, "time_diff": 1.497793, "time_forward": 0.413042, "time_loss": 0.000390}
[03/28 13:55:19] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "2010", "eta": "2:19:27", "loss": 0.068273, "lr": 0.033330, "mode": "train", "time_backward": 1.097938, "time_data": 0.020914, "time_diff": 1.526965, "time_forward": 0.405356, "time_loss": 0.000431}
[03/28 13:55:42] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "2020", "eta": "2:18:36", "loss": 0.068255, "lr": 0.033346, "mode": "train", "time_backward": 1.067433, "time_data": 0.019843, "time_diff": 1.492538, "time_forward": 0.401720, "time_loss": 0.000354}
[03/28 13:55:57] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "2030", "eta": "2:18:16", "loss": 0.067038, "lr": 0.033362, "mode": "train", "time_backward": 1.138781, "time_data": 0.022951, "time_diff": 1.599722, "time_forward": 0.432000, "time_loss": 0.000880}
[03/28 13:56:14] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "2040", "eta": "2:17:56", "loss": 0.072438, "lr": 0.033379, "mode": "train", "time_backward": 1.063603, "time_data": 0.017345, "time_diff": 1.575003, "time_forward": 0.401059, "time_loss": 0.000371}
[03/28 13:56:40] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "2050", "eta": "2:17:33", "loss": 0.067958, "lr": 0.033395, "mode": "train", "time_backward": 1.064741, "time_data": 0.021021, "time_diff": 1.538164, "time_forward": 0.448229, "time_loss": 0.000440}
[03/28 13:56:56] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "2060", "eta": "2:17:15", "loss": 0.069909, "lr": 0.033411, "mode": "train", "time_backward": 1.111481, "time_data": 0.016917, "time_diff": 1.737578, "time_forward": 0.408635, "time_loss": 0.001177}
[03/28 13:57:18] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "2070", "eta": "2:16:54", "loss": 0.067189, "lr": 0.033428, "mode": "train", "time_backward": 1.113370, "time_data": 0.017378, "time_diff": 1.539727, "time_forward": 0.399866, "time_loss": 0.000350}
[03/28 13:57:51] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "2080", "eta": "2:16:33", "loss": 0.065101, "lr": 0.033444, "mode": "train", "time_backward": 1.060172, "time_data": 0.017489, "time_diff": 1.481776, "time_forward": 0.400504, "time_loss": 0.000335}
[03/28 13:58:07] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "2090", "eta": "2:16:02", "loss": 0.067678, "lr": 0.033460, "mode": "train", "time_backward": 1.057448, "time_data": 0.016804, "time_diff": 1.479421, "time_forward": 0.399129, "time_loss": 0.000240}
[03/28 13:58:32] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "2100", "eta": "2:15:38", "loss": 0.070711, "lr": 0.033477, "mode": "train", "time_backward": 1.059604, "time_data": 0.016818, "time_diff": 1.482944, "time_forward": 0.399391, "time_loss": 0.000258}
[03/28 13:59:01] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "2110", "eta": "2:15:33", "loss": 0.075089, "lr": 0.033493, "mode": "train", "time_backward": 3.037513, "time_data": 0.017085, "time_diff": 3.461430, "time_forward": 0.401044, "time_loss": 0.000267}
[03/28 13:59:31] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "2120", "eta": "2:15:13", "loss": 0.076076, "lr": 0.033509, "mode": "train", "time_backward": 1.095426, "time_data": 0.017043, "time_diff": 1.520860, "time_forward": 0.401305, "time_loss": 0.000351}
[03/28 13:59:49] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "2130", "eta": "2:14:52", "loss": 0.064733, "lr": 0.033526, "mode": "train", "time_backward": 1.059218, "time_data": 0.017086, "time_diff": 1.480866, "time_forward": 0.400826, "time_loss": 0.000467}
[03/28 14:00:28] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "2140", "eta": "2:14:32", "loss": 0.081088, "lr": 0.033542, "mode": "train", "time_backward": 1.055976, "time_data": 0.019686, "time_diff": 1.517837, "time_forward": 0.400021, "time_loss": 0.000315}
[03/28 14:00:44] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "2150", "eta": "2:14:11", "loss": 0.069706, "lr": 0.033558, "mode": "train", "time_backward": 1.058115, "time_data": 0.017380, "time_diff": 1.482897, "time_forward": 0.401199, "time_loss": 0.000417}
[03/28 14:00:59] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "2160", "eta": "2:13:50", "loss": 0.068591, "lr": 0.033575, "mode": "train", "time_backward": 1.067681, "time_data": 0.020283, "time_diff": 1.502226, "time_forward": 0.401000, "time_loss": 0.000393}
[03/28 14:01:16] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "2170", "eta": "2:13:30", "loss": 0.065571, "lr": 0.033591, "mode": "train", "time_backward": 1.057980, "time_data": 0.029142, "time_diff": 1.511896, "time_forward": 0.403138, "time_loss": 0.000442}
[03/28 14:01:31] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "2180", "eta": "2:13:09", "loss": 0.066631, "lr": 0.033607, "mode": "train", "time_backward": 1.055689, "time_data": 0.016906, "time_diff": 1.477471, "time_forward": 0.400661, "time_loss": 0.000803}
[03/28 14:01:49] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "2190", "eta": "2:12:49", "loss": 0.062306, "lr": 0.033624, "mode": "train", "time_backward": 1.104597, "time_data": 0.017472, "time_diff": 1.595032, "time_forward": 0.402855, "time_loss": 0.000476}
[03/28 14:02:04] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "2200", "eta": "2:12:28", "loss": 0.072584, "lr": 0.033640, "mode": "train", "time_backward": 1.080705, "time_data": 0.017137, "time_diff": 1.505353, "time_forward": 0.399645, "time_loss": 0.000276}
[03/28 14:02:32] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "2210", "eta": "2:12:08", "loss": 0.071056, "lr": 0.033656, "mode": "train", "time_backward": 1.074760, "time_data": 0.017851, "time_diff": 1.497884, "time_forward": 0.401588, "time_loss": 0.000442}
[03/28 14:02:52] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "2220", "eta": "2:11:47", "loss": 0.068468, "lr": 0.033673, "mode": "train", "time_backward": 1.070341, "time_data": 0.017178, "time_diff": 1.497001, "time_forward": 0.404941, "time_loss": 0.001024}
[03/28 14:03:08] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "2230", "eta": "2:11:25", "loss": 0.061603, "lr": 0.033689, "mode": "train", "time_backward": 1.057237, "time_data": 0.017069, "time_diff": 1.483755, "time_forward": 0.401182, "time_loss": 0.000437}
[03/28 14:03:25] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "2240", "eta": "2:11:08", "loss": 0.070274, "lr": 0.033705, "mode": "train", "time_backward": 1.431852, "time_data": 0.086752, "time_diff": 2.073749, "time_forward": 0.548749, "time_loss": 0.000412}
[03/28 14:03:42] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "2250", "eta": "2:10:48", "loss": 0.072774, "lr": 0.033722, "mode": "train", "time_backward": 1.089081, "time_data": 0.017282, "time_diff": 1.581742, "time_forward": 0.406178, "time_loss": 0.005673}
[03/28 14:04:01] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "2260", "eta": "2:10:26", "loss": 0.066309, "lr": 0.033738, "mode": "train", "time_backward": 1.108241, "time_data": 0.027844, "time_diff": 1.615599, "time_forward": 0.400340, "time_loss": 0.000376}
[03/28 14:04:17] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "2270", "eta": "2:10:05", "loss": 0.072667, "lr": 0.033754, "mode": "train", "time_backward": 1.059974, "time_data": 0.019017, "time_diff": 1.481692, "time_forward": 0.399700, "time_loss": 0.000231}
[03/28 14:04:39] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "2280", "eta": "2:09:44", "loss": 0.072436, "lr": 0.033771, "mode": "train", "time_backward": 1.055225, "time_data": 0.016860, "time_diff": 1.479272, "time_forward": 0.403520, "time_loss": 0.000374}
[03/28 14:05:01] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "2290", "eta": "2:09:22", "loss": 0.067900, "lr": 0.033787, "mode": "train", "time_backward": 1.101808, "time_data": 0.017258, "time_diff": 1.620477, "time_forward": 0.398756, "time_loss": 0.000324}
[03/28 14:05:17] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "2300", "eta": "2:09:02", "loss": 0.065927, "lr": 0.033803, "mode": "train", "time_backward": 1.062811, "time_data": 0.016945, "time_diff": 1.484432, "time_forward": 0.399193, "time_loss": 0.000236}
[03/28 14:05:33] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "2310", "eta": "2:08:41", "loss": 0.064880, "lr": 0.033820, "mode": "train", "time_backward": 1.092168, "time_data": 0.020611, "time_diff": 1.569092, "time_forward": 0.447940, "time_loss": 0.000379}
[03/28 14:05:49] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "2320", "eta": "2:08:21", "loss": 0.067796, "lr": 0.033836, "mode": "train", "time_backward": 1.093577, "time_data": 0.030459, "time_diff": 1.628294, "time_forward": 0.399675, "time_loss": 0.000320}
[03/28 14:06:06] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "2330", "eta": "2:08:02", "loss": 0.073353, "lr": 0.033852, "mode": "train", "time_backward": 1.186403, "time_data": 0.022956, "time_diff": 1.726973, "time_forward": 0.454654, "time_loss": 0.000803}
[03/28 14:06:22] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "2340", "eta": "2:07:42", "loss": 0.076736, "lr": 0.033869, "mode": "train", "time_backward": 1.078129, "time_data": 0.017188, "time_diff": 1.500373, "time_forward": 0.401490, "time_loss": 0.000254}
[03/28 14:06:39] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "2350", "eta": "2:07:22", "loss": 0.064080, "lr": 0.033885, "mode": "train", "time_backward": 1.081508, "time_data": 0.017835, "time_diff": 1.500603, "time_forward": 0.400643, "time_loss": 0.000352}
[03/28 14:06:56] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "2360", "eta": "2:07:12", "loss": 0.067509, "lr": 0.033901, "mode": "train", "time_backward": 1.452546, "time_data": 0.017195, "time_diff": 2.907206, "time_forward": 0.402157, "time_loss": 0.000425}
[03/28 14:07:16] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "2370", "eta": "2:06:51", "loss": 0.064562, "lr": 0.033918, "mode": "train", "time_backward": 1.055250, "time_data": 0.058900, "time_diff": 1.532858, "time_forward": 0.415393, "time_loss": 0.000329}
[03/28 14:07:31] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "2380", "eta": "2:06:30", "loss": 0.069241, "lr": 0.033934, "mode": "train", "time_backward": 1.113396, "time_data": 0.018204, "time_diff": 1.537326, "time_forward": 0.399668, "time_loss": 0.000411}
[03/28 14:07:46] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "2390", "eta": "2:06:09", "loss": 0.067409, "lr": 0.033950, "mode": "train", "time_backward": 1.056476, "time_data": 0.017456, "time_diff": 1.478149, "time_forward": 0.399635, "time_loss": 0.000279}
[03/28 14:08:02] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "2400", "eta": "2:05:48", "loss": 0.067045, "lr": 0.033967, "mode": "train", "time_backward": 1.055767, "time_data": 0.016960, "time_diff": 1.483863, "time_forward": 0.399616, "time_loss": 0.000330}
[03/28 14:08:26] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "2410", "eta": "2:05:27", "loss": 0.061477, "lr": 0.033983, "mode": "train", "time_backward": 1.129921, "time_data": 0.018583, "time_diff": 1.550584, "time_forward": 0.401469, "time_loss": 0.000276}
[03/28 14:08:47] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "2420", "eta": "2:05:07", "loss": 0.073944, "lr": 0.033999, "mode": "train", "time_backward": 1.055480, "time_data": 0.016813, "time_diff": 1.482869, "time_forward": 0.398393, "time_loss": 0.000228}
[03/28 14:09:02] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "2430", "eta": "2:04:47", "loss": 0.072584, "lr": 0.034016, "mode": "train", "time_backward": 1.073976, "time_data": 0.018476, "time_diff": 1.549637, "time_forward": 0.444643, "time_loss": 0.000360}
[03/28 14:09:18] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "2440", "eta": "2:04:25", "loss": 0.067439, "lr": 0.034032, "mode": "train", "time_backward": 1.060301, "time_data": 0.017311, "time_diff": 1.480530, "time_forward": 0.400131, "time_loss": 0.000255}
[03/28 14:09:47] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "2450", "eta": "2:04:05", "loss": 0.068254, "lr": 0.034048, "mode": "train", "time_backward": 1.081730, "time_data": 0.017745, "time_diff": 1.559543, "time_forward": 0.452751, "time_loss": 0.000284}
[03/28 14:10:07] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "2460", "eta": "2:03:45", "loss": 0.066265, "lr": 0.034064, "mode": "train", "time_backward": 1.064215, "time_data": 0.016887, "time_diff": 1.560331, "time_forward": 0.478650, "time_loss": 0.000252}
[03/28 14:10:31] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "2470", "eta": "2:03:24", "loss": 0.059544, "lr": 0.034081, "mode": "train", "time_backward": 1.069903, "time_data": 0.017056, "time_diff": 1.514911, "time_forward": 0.399194, "time_loss": 0.000261}
[03/28 14:10:46] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "2480", "eta": "2:03:04", "loss": 0.067607, "lr": 0.034097, "mode": "train", "time_backward": 1.068746, "time_data": 0.024373, "time_diff": 1.609193, "time_forward": 0.423369, "time_loss": 0.000390}
[03/28 14:11:02] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "2490", "eta": "2:02:44", "loss": 0.070312, "lr": 0.034113, "mode": "train", "time_backward": 1.058106, "time_data": 0.017872, "time_diff": 1.488738, "time_forward": 0.408866, "time_loss": 0.000390}
[03/28 14:11:17] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "2500", "eta": "2:02:23", "loss": 0.069061, "lr": 0.034130, "mode": "train", "time_backward": 1.057523, "time_data": 0.025315, "time_diff": 1.512183, "time_forward": 0.419186, "time_loss": 0.000398}
[03/28 14:11:54] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "2510", "eta": "2:02:02", "loss": 0.078876, "lr": 0.034146, "mode": "train", "time_backward": 1.058414, "time_data": 0.017345, "time_diff": 1.523006, "time_forward": 0.443744, "time_loss": 0.000383}
[03/28 14:12:18] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "2520", "eta": "2:01:41", "loss": 0.068223, "lr": 0.034162, "mode": "train", "time_backward": 1.064816, "time_data": 0.021973, "time_diff": 1.497651, "time_forward": 0.407230, "time_loss": 0.000342}
[03/28 14:12:33] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "2530", "eta": "2:01:20", "loss": 0.072202, "lr": 0.034179, "mode": "train", "time_backward": 1.083680, "time_data": 0.016950, "time_diff": 1.509296, "time_forward": 0.405461, "time_loss": 0.000250}
[03/28 14:12:58] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "2540", "eta": "2:00:59", "loss": 0.066856, "lr": 0.034195, "mode": "train", "time_backward": 1.076152, "time_data": 0.017014, "time_diff": 1.499599, "time_forward": 0.402787, "time_loss": 0.000293}
[03/28 14:13:14] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "2550", "eta": "2:00:37", "loss": 0.068671, "lr": 0.034211, "mode": "train", "time_backward": 1.086967, "time_data": 0.030285, "time_diff": 1.568942, "time_forward": 0.444414, "time_loss": 0.000252}
[03/28 14:13:29] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "2560", "eta": "2:00:17", "loss": 0.072146, "lr": 0.034228, "mode": "train", "time_backward": 1.056600, "time_data": 0.017249, "time_diff": 1.559365, "time_forward": 0.399627, "time_loss": 0.000399}
[03/28 14:13:57] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "2570", "eta": "1:59:57", "loss": 0.066725, "lr": 0.034244, "mode": "train", "time_backward": 1.062886, "time_data": 0.021361, "time_diff": 1.659103, "time_forward": 0.540896, "time_loss": 0.000292}
[03/28 14:14:12] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "2580", "eta": "1:59:36", "loss": 0.072386, "lr": 0.034260, "mode": "train", "time_backward": 1.146055, "time_data": 0.024107, "time_diff": 1.579530, "time_forward": 0.400114, "time_loss": 0.000272}
[03/28 14:14:35] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "2590", "eta": "1:59:16", "loss": 0.066393, "lr": 0.034277, "mode": "train", "time_backward": 1.054612, "time_data": 0.017150, "time_diff": 1.478381, "time_forward": 0.399800, "time_loss": 0.000240}
[03/28 14:14:56] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "2600", "eta": "1:58:56", "loss": 0.070093, "lr": 0.034293, "mode": "train", "time_backward": 1.104069, "time_data": 0.025399, "time_diff": 1.535535, "time_forward": 0.402481, "time_loss": 0.000257}
[03/28 14:15:12] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "2610", "eta": "1:58:34", "loss": 0.066113, "lr": 0.034309, "mode": "train", "time_backward": 1.071785, "time_data": 0.018179, "time_diff": 1.532974, "time_forward": 0.409522, "time_loss": 0.000281}
[03/28 14:15:28] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "2620", "eta": "1:58:14", "loss": 0.069790, "lr": 0.034326, "mode": "train", "time_backward": 1.058991, "time_data": 0.019910, "time_diff": 1.488859, "time_forward": 0.401244, "time_loss": 0.000427}
[03/28 14:15:43] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "2630", "eta": "1:57:54", "loss": 0.074902, "lr": 0.034342, "mode": "train", "time_backward": 1.122155, "time_data": 0.025053, "time_diff": 1.555458, "time_forward": 0.401133, "time_loss": 0.000240}
[03/28 14:15:59] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "2640", "eta": "1:57:34", "loss": 0.065583, "lr": 0.034358, "mode": "train", "time_backward": 1.054679, "time_data": 0.019337, "time_diff": 1.551967, "time_forward": 0.468925, "time_loss": 0.000246}
[03/28 14:16:16] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "2650", "eta": "1:57:11", "loss": 0.070266, "lr": 0.034375, "mode": "train", "time_backward": 1.086821, "time_data": 0.036805, "time_diff": 1.567679, "time_forward": 0.426051, "time_loss": 0.000239}
[03/28 14:16:31] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "2660", "eta": "1:56:51", "loss": 0.065696, "lr": 0.034391, "mode": "train", "time_backward": 1.085108, "time_data": 0.017955, "time_diff": 1.524499, "time_forward": 0.413076, "time_loss": 0.000325}
[03/28 14:16:47] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "2670", "eta": "1:56:31", "loss": 0.073863, "lr": 0.034407, "mode": "train", "time_backward": 1.135924, "time_data": 0.018861, "time_diff": 1.562590, "time_forward": 0.399975, "time_loss": 0.000364}
[03/28 14:17:02] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "2680", "eta": "1:56:11", "loss": 0.072726, "lr": 0.034424, "mode": "train", "time_backward": 1.196458, "time_data": 0.020205, "time_diff": 1.656984, "time_forward": 0.421639, "time_loss": 0.000341}
[03/28 14:17:19] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "2690", "eta": "1:55:51", "loss": 0.069847, "lr": 0.034440, "mode": "train", "time_backward": 1.054273, "time_data": 0.087851, "time_diff": 1.599134, "time_forward": 0.453407, "time_loss": 0.000269}
[03/28 14:17:35] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "2700", "eta": "1:55:32", "loss": 0.068821, "lr": 0.034456, "mode": "train", "time_backward": 1.096944, "time_data": 0.020440, "time_diff": 1.586446, "time_forward": 0.413903, "time_loss": 0.000400}
[03/28 14:17:51] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "2710", "eta": "1:55:11", "loss": 0.072755, "lr": 0.034473, "mode": "train", "time_backward": 1.113157, "time_data": 0.017259, "time_diff": 1.545690, "time_forward": 0.399166, "time_loss": 0.000317}
[03/28 14:18:06] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "2720", "eta": "1:54:49", "loss": 0.064742, "lr": 0.034489, "mode": "train", "time_backward": 1.127285, "time_data": 0.019524, "time_diff": 1.579992, "time_forward": 0.414904, "time_loss": 0.003501}
[03/28 14:18:22] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "2730", "eta": "1:54:28", "loss": 0.063401, "lr": 0.034505, "mode": "train", "time_backward": 1.053002, "time_data": 0.022217, "time_diff": 1.532672, "time_forward": 0.453816, "time_loss": 0.000245}
[03/28 14:18:38] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "2740", "eta": "1:54:08", "loss": 0.063941, "lr": 0.034522, "mode": "train", "time_backward": 1.119939, "time_data": 0.021572, "time_diff": 1.576892, "time_forward": 0.398858, "time_loss": 0.000290}
[03/28 14:18:53] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "2750", "eta": "1:53:48", "loss": 0.068571, "lr": 0.034538, "mode": "train", "time_backward": 1.082398, "time_data": 0.018068, "time_diff": 1.502271, "time_forward": 0.398450, "time_loss": 0.000224}
[03/28 14:19:09] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "2760", "eta": "1:53:28", "loss": 0.062395, "lr": 0.034554, "mode": "train", "time_backward": 1.081587, "time_data": 0.026217, "time_diff": 1.559365, "time_forward": 0.405665, "time_loss": 0.000381}
[03/28 14:19:26] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "2770", "eta": "1:53:08", "loss": 0.067730, "lr": 0.034571, "mode": "train", "time_backward": 1.134255, "time_data": 0.017290, "time_diff": 1.584679, "time_forward": 0.400161, "time_loss": 0.000385}
[03/28 14:19:41] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "2780", "eta": "1:52:49", "loss": 0.063429, "lr": 0.034587, "mode": "train", "time_backward": 1.075122, "time_data": 0.020745, "time_diff": 1.780620, "time_forward": 0.400163, "time_loss": 0.000671}
[03/28 14:19:57] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "2790", "eta": "1:52:30", "loss": 0.061218, "lr": 0.034603, "mode": "train", "time_backward": 1.151862, "time_data": 0.022386, "time_diff": 1.578068, "time_forward": 0.402722, "time_loss": 0.000214}
[03/28 14:20:13] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "2800", "eta": "1:52:09", "loss": 0.067077, "lr": 0.034620, "mode": "train", "time_backward": 1.116146, "time_data": 0.016962, "time_diff": 1.535251, "time_forward": 0.398622, "time_loss": 0.000252}
[03/28 14:20:28] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "2810", "eta": "1:51:49", "loss": 0.067535, "lr": 0.034636, "mode": "train", "time_backward": 1.079147, "time_data": 0.026256, "time_diff": 1.531726, "time_forward": 0.402900, "time_loss": 0.000356}
[03/28 14:20:45] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "2820", "eta": "1:51:29", "loss": 0.068062, "lr": 0.034652, "mode": "train", "time_backward": 1.083621, "time_data": 0.030838, "time_diff": 1.561835, "time_forward": 0.411977, "time_loss": 0.000365}
[03/28 14:21:01] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "2830", "eta": "1:51:09", "loss": 0.068710, "lr": 0.034669, "mode": "train", "time_backward": 1.060932, "time_data": 0.023303, "time_diff": 1.504138, "time_forward": 0.408289, "time_loss": 0.000301}
[03/28 14:21:16] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "2840", "eta": "1:50:49", "loss": 0.069728, "lr": 0.034685, "mode": "train", "time_backward": 1.079807, "time_data": 0.019320, "time_diff": 1.506529, "time_forward": 0.398551, "time_loss": 0.000230}
[03/28 14:21:43] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "2850", "eta": "1:50:29", "loss": 0.062611, "lr": 0.034701, "mode": "train", "time_backward": 1.095801, "time_data": 0.017901, "time_diff": 1.521306, "time_forward": 0.400353, "time_loss": 0.000321}
[03/28 14:21:58] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "2860", "eta": "1:50:08", "loss": 0.065880, "lr": 0.034718, "mode": "train", "time_backward": 1.057923, "time_data": 0.019161, "time_diff": 1.484343, "time_forward": 0.403354, "time_loss": 0.000744}
[03/28 14:22:14] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "2870", "eta": "1:49:48", "loss": 0.069177, "lr": 0.034734, "mode": "train", "time_backward": 1.098347, "time_data": 0.018351, "time_diff": 1.561054, "time_forward": 0.432298, "time_loss": 0.000280}
[03/28 14:22:34] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "2880", "eta": "1:49:27", "loss": 0.072014, "lr": 0.034750, "mode": "train", "time_backward": 1.096015, "time_data": 0.017177, "time_diff": 1.516610, "time_forward": 0.400196, "time_loss": 0.000329}
[03/28 14:22:50] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "2890", "eta": "1:49:07", "loss": 0.066162, "lr": 0.034766, "mode": "train", "time_backward": 1.077775, "time_data": 0.017518, "time_diff": 1.501057, "time_forward": 0.399416, "time_loss": 0.000281}
[03/28 14:23:05] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "2900", "eta": "1:48:46", "loss": 0.068955, "lr": 0.034783, "mode": "train", "time_backward": 1.103896, "time_data": 0.019903, "time_diff": 1.538921, "time_forward": 0.408650, "time_loss": 0.000306}
[03/28 14:23:20] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "2910", "eta": "1:48:26", "loss": 0.062720, "lr": 0.034799, "mode": "train", "time_backward": 1.066106, "time_data": 0.016826, "time_diff": 1.484860, "time_forward": 0.398237, "time_loss": 0.000427}
[03/28 14:23:43] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "2920", "eta": "1:48:11", "loss": 0.062930, "lr": 0.034815, "mode": "train", "time_backward": 1.739902, "time_data": 0.023044, "time_diff": 2.385779, "time_forward": 0.456818, "time_loss": 0.000370}
[03/28 14:23:59] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "2930", "eta": "1:47:51", "loss": 0.067799, "lr": 0.034832, "mode": "train", "time_backward": 1.062110, "time_data": 0.017097, "time_diff": 1.483376, "time_forward": 0.399943, "time_loss": 0.000280}
[03/28 14:24:14] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "2940", "eta": "1:47:31", "loss": 0.064336, "lr": 0.034848, "mode": "train", "time_backward": 1.053970, "time_data": 0.019436, "time_diff": 1.567149, "time_forward": 0.489953, "time_loss": 0.000319}
[03/28 14:24:42] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "2950", "eta": "1:47:09", "loss": 0.068373, "lr": 0.034864, "mode": "train", "time_backward": 1.125237, "time_data": 0.020350, "time_diff": 1.580343, "time_forward": 0.416397, "time_loss": 0.005993}
[03/28 14:25:01] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "2960", "eta": "1:46:49", "loss": 0.063530, "lr": 0.034881, "mode": "train", "time_backward": 1.088399, "time_data": 0.034618, "time_diff": 1.541402, "time_forward": 0.410727, "time_loss": 0.000334}
[03/28 14:25:17] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "2970", "eta": "1:46:28", "loss": 0.072897, "lr": 0.034897, "mode": "train", "time_backward": 1.058731, "time_data": 0.017275, "time_diff": 1.483166, "time_forward": 0.401083, "time_loss": 0.000453}
[03/28 14:26:24] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "2980", "eta": "1:45:45", "loss": 0.069017, "lr": 0.034913, "mode": "train", "time_backward": 1.061104, "time_data": 0.017664, "time_diff": 1.488891, "time_forward": 0.401212, "time_loss": 0.000276}
[03/28 14:26:49] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "2990", "eta": "1:46:28", "loss": 0.069177, "lr": 0.034930, "mode": "train", "time_backward": 1.133189, "time_data": 10.059490, "time_diff": 11.614188, "time_forward": 0.417343, "time_loss": 0.000996}
[03/28 14:27:04] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "3000", "eta": "1:46:08", "loss": 0.066511, "lr": 0.034946, "mode": "train", "time_backward": 1.057214, "time_data": 0.019491, "time_diff": 1.482728, "time_forward": 0.400478, "time_loss": 0.000352}
[03/28 14:27:20] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "3010", "eta": "1:45:48", "loss": 0.068345, "lr": 0.034962, "mode": "train", "time_backward": 1.097316, "time_data": 0.036467, "time_diff": 1.567823, "time_forward": 0.426745, "time_loss": 0.000503}
[03/28 14:27:35] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "3020", "eta": "1:45:27", "loss": 0.067909, "lr": 0.034979, "mode": "train", "time_backward": 1.053812, "time_data": 0.016987, "time_diff": 1.477457, "time_forward": 0.400299, "time_loss": 0.000222}
[03/28 14:27:50] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "3030", "eta": "1:45:07", "loss": 0.067404, "lr": 0.034995, "mode": "train", "time_backward": 1.066591, "time_data": 0.018610, "time_diff": 1.489451, "time_forward": 0.403386, "time_loss": 0.000415}
[03/28 14:28:07] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "3040", "eta": "1:44:46", "loss": 0.069129, "lr": 0.035011, "mode": "train", "time_backward": 1.092739, "time_data": 0.018661, "time_diff": 1.522928, "time_forward": 0.404318, "time_loss": 0.000264}
[03/28 14:28:22] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "3050", "eta": "1:44:26", "loss": 0.062057, "lr": 0.035028, "mode": "train", "time_backward": 1.055872, "time_data": 0.018548, "time_diff": 1.482693, "time_forward": 0.401025, "time_loss": 0.000472}
[03/28 14:28:38] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "3060", "eta": "1:44:13", "loss": 0.064504, "lr": 0.035044, "mode": "train", "time_backward": 1.898465, "time_data": 0.017102, "time_diff": 2.708875, "time_forward": 0.403455, "time_loss": 0.000247}
[03/28 14:28:59] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "3070", "eta": "1:43:52", "loss": 0.066406, "lr": 0.035060, "mode": "train", "time_backward": 1.140430, "time_data": 0.016841, "time_diff": 1.586688, "time_forward": 0.423173, "time_loss": 0.000489}
[03/28 14:29:16] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "3080", "eta": "1:43:32", "loss": 0.068217, "lr": 0.035077, "mode": "train", "time_backward": 1.077087, "time_data": 0.113689, "time_diff": 1.602828, "time_forward": 0.402495, "time_loss": 0.000247}
[03/28 14:29:31] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "3090", "eta": "1:43:11", "loss": 0.065912, "lr": 0.035093, "mode": "train", "time_backward": 1.090575, "time_data": 0.019653, "time_diff": 1.539275, "time_forward": 0.409489, "time_loss": 0.000257}
[03/28 14:30:06] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "3100", "eta": "1:44:28", "loss": 0.068434, "lr": 0.035109, "mode": "train", "time_backward": 20.271252, "time_data": 0.017085, "time_diff": 20.712756, "time_forward": 0.399303, "time_loss": 0.000675}
[03/28 14:30:21] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "3110", "eta": "1:44:08", "loss": 0.066277, "lr": 0.035126, "mode": "train", "time_backward": 1.111768, "time_data": 0.017491, "time_diff": 1.559830, "time_forward": 0.411262, "time_loss": 0.000325}
[03/28 14:30:37] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "3120", "eta": "1:43:48", "loss": 0.073331, "lr": 0.035142, "mode": "train", "time_backward": 1.172398, "time_data": 0.017094, "time_diff": 1.605701, "time_forward": 0.399218, "time_loss": 0.000332}
[03/28 14:30:53] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "3130", "eta": "1:43:27", "loss": 0.069349, "lr": 0.035158, "mode": "train", "time_backward": 1.055035, "time_data": 0.017166, "time_diff": 1.481164, "time_forward": 0.399595, "time_loss": 0.000293}
[03/28 14:31:18] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "3140", "eta": "1:43:07", "loss": 0.068975, "lr": 0.035175, "mode": "train", "time_backward": 1.071701, "time_data": 0.023023, "time_diff": 1.609731, "time_forward": 0.447657, "time_loss": 0.000709}
[03/28 14:31:35] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "3150", "eta": "1:42:46", "loss": 0.065320, "lr": 0.035191, "mode": "train", "time_backward": 1.059591, "time_data": 0.016879, "time_diff": 1.514055, "time_forward": 0.409627, "time_loss": 0.000249}
[03/28 14:31:51] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "3160", "eta": "1:42:25", "loss": 0.064648, "lr": 0.035207, "mode": "train", "time_backward": 1.053485, "time_data": 0.017159, "time_diff": 1.564912, "time_forward": 0.448556, "time_loss": 0.000410}
[03/28 14:32:08] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "3170", "eta": "1:42:04", "loss": 0.060611, "lr": 0.035224, "mode": "train", "time_backward": 1.056292, "time_data": 0.017198, "time_diff": 1.497578, "time_forward": 0.410758, "time_loss": 0.000920}
[03/28 14:32:24] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "3180", "eta": "1:40:56", "loss": 0.065668, "lr": 0.035240, "mode": "train", "time_backward": 1.057739, "time_data": 0.016782, "time_diff": 1.480727, "time_forward": 0.398941, "time_loss": 0.000314}
[03/28 14:32:49] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "3190", "eta": "1:40:35", "loss": 0.064337, "lr": 0.035256, "mode": "train", "time_backward": 1.058471, "time_data": 0.017271, "time_diff": 1.481472, "time_forward": 0.400423, "time_loss": 0.000476}
[03/28 14:33:16] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "3200", "eta": "1:40:16", "loss": 0.063522, "lr": 0.035273, "mode": "train", "time_backward": 1.172697, "time_data": 0.022224, "time_diff": 1.618820, "time_forward": 0.409273, "time_loss": 0.000383}
[03/28 14:33:31] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "3210", "eta": "1:39:55", "loss": 0.067377, "lr": 0.035289, "mode": "train", "time_backward": 1.091121, "time_data": 0.018151, "time_diff": 1.534522, "time_forward": 0.407915, "time_loss": 0.000270}
[03/28 14:33:52] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "3220", "eta": "1:39:34", "loss": 0.067616, "lr": 0.035305, "mode": "train", "time_backward": 1.095476, "time_data": 0.020160, "time_diff": 1.529481, "time_forward": 0.410055, "time_loss": 0.000269}
[03/28 14:34:08] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "3230", "eta": "1:39:17", "loss": 0.062162, "lr": 0.035322, "mode": "train", "time_backward": 1.847923, "time_data": 0.017403, "time_diff": 2.280792, "time_forward": 0.403414, "time_loss": 0.000270}
[03/28 14:34:24] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "3240", "eta": "1:38:57", "loss": 0.065807, "lr": 0.035338, "mode": "train", "time_backward": 1.125360, "time_data": 0.017738, "time_diff": 1.549878, "time_forward": 0.399397, "time_loss": 0.000359}
[03/28 14:34:44] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "3250", "eta": "1:38:39", "loss": 0.060969, "lr": 0.035354, "mode": "train", "time_backward": 1.053888, "time_data": 0.026389, "time_diff": 1.865314, "time_forward": 0.705794, "time_loss": 0.075951}
[03/28 14:35:00] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "3260", "eta": "1:37:39", "loss": 0.068716, "lr": 0.035371, "mode": "train", "time_backward": 1.062473, "time_data": 0.018335, "time_diff": 1.489558, "time_forward": 0.401007, "time_loss": 0.000411}
[03/28 14:35:16] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "3270", "eta": "1:37:21", "loss": 0.065745, "lr": 0.035387, "mode": "train", "time_backward": 1.221649, "time_data": 0.020121, "time_diff": 1.959373, "time_forward": 0.631260, "time_loss": 0.036991}
[03/28 14:35:32] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "3280", "eta": "1:37:02", "loss": 0.066612, "lr": 0.035403, "mode": "train", "time_backward": 1.242294, "time_data": 0.017172, "time_diff": 1.719961, "time_forward": 0.412419, "time_loss": 0.005701}
[03/28 14:35:50] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "3290", "eta": "1:36:41", "loss": 0.067497, "lr": 0.035420, "mode": "train", "time_backward": 1.106037, "time_data": 0.020246, "time_diff": 1.542516, "time_forward": 0.407403, "time_loss": 0.000250}
[03/28 14:36:06] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "3300", "eta": "1:36:21", "loss": 0.067639, "lr": 0.035436, "mode": "train", "time_backward": 1.087786, "time_data": 0.017704, "time_diff": 1.509504, "time_forward": 0.400425, "time_loss": 0.000391}
[03/28 14:36:21] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "3310", "eta": "1:36:00", "loss": 0.068947, "lr": 0.035452, "mode": "train", "time_backward": 1.107605, "time_data": 0.018002, "time_diff": 1.538574, "time_forward": 0.398831, "time_loss": 0.000250}
[03/28 14:36:36] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "3320", "eta": "1:35:40", "loss": 0.066029, "lr": 0.035468, "mode": "train", "time_backward": 1.069656, "time_data": 0.017434, "time_diff": 1.501305, "time_forward": 0.406567, "time_loss": 0.000481}
[03/28 14:36:53] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "3330", "eta": "1:35:19", "loss": 0.065165, "lr": 0.035485, "mode": "train", "time_backward": 1.091820, "time_data": 0.041158, "time_diff": 1.682912, "time_forward": 0.546550, "time_loss": 0.000282}
[03/28 14:37:08] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "3340", "eta": "1:34:58", "loss": 0.071324, "lr": 0.035501, "mode": "train", "time_backward": 1.075078, "time_data": 0.020245, "time_diff": 1.507449, "time_forward": 0.400404, "time_loss": 0.000334}
[03/28 14:37:24] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "3350", "eta": "1:34:37", "loss": 0.062474, "lr": 0.035517, "mode": "train", "time_backward": 1.075559, "time_data": 0.017830, "time_diff": 1.520438, "time_forward": 0.403837, "time_loss": 0.000428}
[03/28 14:37:40] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "3360", "eta": "1:34:16", "loss": 0.066406, "lr": 0.035534, "mode": "train", "time_backward": 1.056457, "time_data": 0.022320, "time_diff": 1.482008, "time_forward": 0.398952, "time_loss": 0.000233}
[03/28 14:37:55] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "3370", "eta": "1:33:55", "loss": 0.069460, "lr": 0.035550, "mode": "train", "time_backward": 1.113378, "time_data": 0.016758, "time_diff": 1.563301, "time_forward": 0.399580, "time_loss": 0.000365}
[03/28 14:38:11] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "3380", "eta": "1:33:35", "loss": 0.062279, "lr": 0.035566, "mode": "train", "time_backward": 1.099768, "time_data": 0.017547, "time_diff": 1.523342, "time_forward": 0.398257, "time_loss": 0.000499}
[03/28 14:38:26] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "3390", "eta": "1:33:14", "loss": 0.064883, "lr": 0.035583, "mode": "train", "time_backward": 1.103230, "time_data": 0.017584, "time_diff": 1.537701, "time_forward": 0.404189, "time_loss": 0.000226}
[03/28 14:38:43] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "3400", "eta": "1:32:53", "loss": 0.069462, "lr": 0.035599, "mode": "train", "time_backward": 1.082198, "time_data": 0.020090, "time_diff": 1.597462, "time_forward": 0.489584, "time_loss": 0.000330}
[03/28 14:39:00] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "3410", "eta": "1:32:32", "loss": 0.062229, "lr": 0.035615, "mode": "train", "time_backward": 1.066472, "time_data": 0.017867, "time_diff": 1.488741, "time_forward": 0.400818, "time_loss": 0.000295}
[03/28 14:39:15] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "3420", "eta": "1:32:11", "loss": 0.061655, "lr": 0.035632, "mode": "train", "time_backward": 1.068129, "time_data": 0.018528, "time_diff": 1.497852, "time_forward": 0.399992, "time_loss": 0.000305}
[03/28 14:39:32] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "3430", "eta": "1:31:52", "loss": 0.066623, "lr": 0.035648, "mode": "train", "time_backward": 1.268250, "time_data": 0.017634, "time_diff": 1.704445, "time_forward": 0.410034, "time_loss": 0.000246}
[03/28 14:39:48] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "3440", "eta": "1:31:31", "loss": 0.071571, "lr": 0.035664, "mode": "train", "time_backward": 1.072752, "time_data": 0.016857, "time_diff": 1.494370, "time_forward": 0.398682, "time_loss": 0.000248}
[03/28 14:40:08] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "3450", "eta": "1:31:11", "loss": 0.068711, "lr": 0.035681, "mode": "train", "time_backward": 1.120911, "time_data": 0.017468, "time_diff": 1.591610, "time_forward": 0.449983, "time_loss": 0.000347}
[03/28 14:40:23] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "3460", "eta": "1:30:50", "loss": 0.064714, "lr": 0.035697, "mode": "train", "time_backward": 1.057810, "time_data": 0.018161, "time_diff": 1.481372, "time_forward": 0.400673, "time_loss": 0.000377}
[03/28 14:40:38] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "3470", "eta": "1:30:29", "loss": 0.066030, "lr": 0.035713, "mode": "train", "time_backward": 1.056575, "time_data": 0.017411, "time_diff": 1.479535, "time_forward": 0.398261, "time_loss": 0.000246}
[03/28 14:40:53] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "3480", "eta": "1:30:08", "loss": 0.064469, "lr": 0.035730, "mode": "train", "time_backward": 1.058856, "time_data": 0.017119, "time_diff": 1.519505, "time_forward": 0.400055, "time_loss": 0.000316}
[03/28 14:41:39] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "3490", "eta": "1:29:47", "loss": 0.069608, "lr": 0.035746, "mode": "train", "time_backward": 1.080885, "time_data": 0.019272, "time_diff": 1.532599, "time_forward": 0.406424, "time_loss": 0.003067}
[03/28 14:42:00] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "3500", "eta": "1:29:27", "loss": 0.062910, "lr": 0.035762, "mode": "train", "time_backward": 1.123376, "time_data": 0.017619, "time_diff": 1.579176, "time_forward": 0.420745, "time_loss": 0.000279}
[03/28 14:42:16] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "3510", "eta": "1:29:06", "loss": 0.066919, "lr": 0.035779, "mode": "train", "time_backward": 1.058560, "time_data": 0.016862, "time_diff": 1.482859, "time_forward": 0.403097, "time_loss": 0.000307}
[03/28 14:42:35] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "3520", "eta": "1:28:45", "loss": 0.068584, "lr": 0.035795, "mode": "train", "time_backward": 1.055785, "time_data": 0.017105, "time_diff": 1.505257, "time_forward": 0.398303, "time_loss": 0.019929}
[03/28 14:42:54] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "3530", "eta": "1:28:24", "loss": 0.064340, "lr": 0.035811, "mode": "train", "time_backward": 1.057088, "time_data": 0.017279, "time_diff": 1.483097, "time_forward": 0.400704, "time_loss": 0.000290}
[03/28 14:43:10] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "3540", "eta": "1:28:01", "loss": 0.060081, "lr": 0.035828, "mode": "train", "time_backward": 1.061740, "time_data": 0.023946, "time_diff": 1.488446, "time_forward": 0.399546, "time_loss": 0.000272}
[03/28 14:43:32] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "3550", "eta": "1:27:41", "loss": 0.066428, "lr": 0.035844, "mode": "train", "time_backward": 1.105199, "time_data": 0.018795, "time_diff": 1.577355, "time_forward": 0.444030, "time_loss": 0.000373}
[03/28 14:43:48] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "3560", "eta": "1:27:20", "loss": 0.069287, "lr": 0.035860, "mode": "train", "time_backward": 1.060025, "time_data": 0.031072, "time_diff": 1.501780, "time_forward": 0.404243, "time_loss": 0.000240}
[03/28 14:44:04] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "3570", "eta": "1:26:59", "loss": 0.061280, "lr": 0.035877, "mode": "train", "time_backward": 1.104519, "time_data": 0.019996, "time_diff": 1.551113, "time_forward": 0.419159, "time_loss": 0.000372}
[03/28 14:44:20] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "3580", "eta": "1:26:39", "loss": 0.064612, "lr": 0.035893, "mode": "train", "time_backward": 1.056046, "time_data": 0.016758, "time_diff": 1.479642, "time_forward": 0.400043, "time_loss": 0.000414}
[03/28 14:44:36] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "3590", "eta": "1:26:18", "loss": 0.066650, "lr": 0.035909, "mode": "train", "time_backward": 1.066410, "time_data": 0.019257, "time_diff": 1.493920, "time_forward": 0.404843, "time_loss": 0.000431}
[03/28 14:44:54] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "3600", "eta": "1:25:58", "loss": 0.066862, "lr": 0.035926, "mode": "train", "time_backward": 1.067578, "time_data": 0.025874, "time_diff": 1.509876, "time_forward": 0.400013, "time_loss": 0.000376}
[03/28 14:45:10] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "3610", "eta": "1:25:37", "loss": 0.066395, "lr": 0.035942, "mode": "train", "time_backward": 1.054347, "time_data": 0.031653, "time_diff": 1.494930, "time_forward": 0.399863, "time_loss": 0.000245}
[03/28 14:45:26] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "3620", "eta": "1:25:16", "loss": 0.062087, "lr": 0.035958, "mode": "train", "time_backward": 1.079121, "time_data": 0.016659, "time_diff": 1.529820, "time_forward": 0.398257, "time_loss": 0.000321}
[03/28 14:45:42] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "3630", "eta": "1:24:57", "loss": 0.064362, "lr": 0.035975, "mode": "train", "time_backward": 1.103268, "time_data": 0.022577, "time_diff": 1.766141, "time_forward": 0.599943, "time_loss": 0.037622}
[03/28 14:45:58] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "3640", "eta": "1:24:36", "loss": 0.063571, "lr": 0.035991, "mode": "train", "time_backward": 1.110308, "time_data": 0.019890, "time_diff": 1.598501, "time_forward": 0.455925, "time_loss": 0.000454}
[03/28 14:46:14] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "3650", "eta": "1:24:16", "loss": 0.067869, "lr": 0.036007, "mode": "train", "time_backward": 1.056590, "time_data": 0.021329, "time_diff": 1.495365, "time_forward": 0.400595, "time_loss": 0.000230}
[03/28 14:46:33] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "3660", "eta": "1:23:55", "loss": 0.060274, "lr": 0.036024, "mode": "train", "time_backward": 1.053207, "time_data": 0.016489, "time_diff": 1.487329, "time_forward": 0.398127, "time_loss": 0.000215}
[03/28 14:46:48] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "3670", "eta": "1:23:34", "loss": 0.073990, "lr": 0.036040, "mode": "train", "time_backward": 1.096357, "time_data": 0.017633, "time_diff": 1.527181, "time_forward": 0.405050, "time_loss": 0.000386}
[03/28 14:47:04] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "3680", "eta": "1:23:14", "loss": 0.065637, "lr": 0.036056, "mode": "train", "time_backward": 1.070052, "time_data": 0.018478, "time_diff": 1.597349, "time_forward": 0.411655, "time_loss": 0.000408}
[03/28 14:47:21] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "3690", "eta": "1:22:54", "loss": 0.062698, "lr": 0.036073, "mode": "train", "time_backward": 1.112345, "time_data": 0.021682, "time_diff": 1.552035, "time_forward": 0.409254, "time_loss": 0.000346}
[03/28 14:47:37] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "3700", "eta": "1:22:33", "loss": 0.064630, "lr": 0.036089, "mode": "train", "time_backward": 1.054165, "time_data": 0.017846, "time_diff": 1.532435, "time_forward": 0.452118, "time_loss": 0.000288}
[03/28 14:47:54] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "3710", "eta": "1:22:13", "loss": 0.058661, "lr": 0.036105, "mode": "train", "time_backward": 1.083692, "time_data": 0.017587, "time_diff": 1.572625, "time_forward": 0.409570, "time_loss": 0.053682}
[03/28 14:48:09] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "3720", "eta": "1:21:52", "loss": 0.063947, "lr": 0.036122, "mode": "train", "time_backward": 1.078373, "time_data": 0.033879, "time_diff": 1.522217, "time_forward": 0.407244, "time_loss": 0.000332}
[03/28 14:48:25] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "3730", "eta": "1:21:31", "loss": 0.066503, "lr": 0.036138, "mode": "train", "time_backward": 1.062097, "time_data": 0.017965, "time_diff": 1.495370, "time_forward": 0.400280, "time_loss": 0.000248}
[03/28 14:48:41] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "3740", "eta": "1:21:10", "loss": 0.071325, "lr": 0.036154, "mode": "train", "time_backward": 1.068801, "time_data": 0.035888, "time_diff": 1.559388, "time_forward": 0.451855, "time_loss": 0.000385}
[03/28 14:48:58] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "3750", "eta": "1:20:49", "loss": 0.068289, "lr": 0.036171, "mode": "train", "time_backward": 1.107983, "time_data": 0.018275, "time_diff": 1.537794, "time_forward": 0.400739, "time_loss": 0.000515}
[03/28 14:49:13] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "3760", "eta": "1:20:28", "loss": 0.059936, "lr": 0.036187, "mode": "train", "time_backward": 1.054170, "time_data": 0.017239, "time_diff": 1.522381, "time_forward": 0.447189, "time_loss": 0.000407}
[03/28 14:49:29] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "3770", "eta": "1:20:07", "loss": 0.067991, "lr": 0.036203, "mode": "train", "time_backward": 1.064933, "time_data": 0.017571, "time_diff": 1.490561, "time_forward": 0.399271, "time_loss": 0.000346}
[03/28 14:49:45] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "3780", "eta": "1:19:47", "loss": 0.062324, "lr": 0.036219, "mode": "train", "time_backward": 1.091990, "time_data": 0.019144, "time_diff": 1.566350, "time_forward": 0.399171, "time_loss": 0.000312}
[03/28 14:50:01] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "3790", "eta": "1:19:26", "loss": 0.064136, "lr": 0.036236, "mode": "train", "time_backward": 1.092360, "time_data": 0.017434, "time_diff": 1.532155, "time_forward": 0.418386, "time_loss": 0.001344}
[03/28 14:50:17] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "3800", "eta": "1:19:05", "loss": 0.069566, "lr": 0.036252, "mode": "train", "time_backward": 1.065645, "time_data": 0.019206, "time_diff": 1.499038, "time_forward": 0.399960, "time_loss": 0.000364}
[03/28 14:50:33] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "3810", "eta": "1:18:44", "loss": 0.069045, "lr": 0.036268, "mode": "train", "time_backward": 1.085961, "time_data": 0.032966, "time_diff": 1.552792, "time_forward": 0.419640, "time_loss": 0.000257}
[03/28 14:50:49] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "3820", "eta": "1:18:22", "loss": 0.064719, "lr": 0.036285, "mode": "train", "time_backward": 1.061367, "time_data": 0.017969, "time_diff": 1.482992, "time_forward": 0.400064, "time_loss": 0.000365}
[03/28 14:51:04] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "3830", "eta": "1:18:01", "loss": 0.063830, "lr": 0.036301, "mode": "train", "time_backward": 1.060751, "time_data": 0.018094, "time_diff": 1.486892, "time_forward": 0.399682, "time_loss": 0.000334}
[03/28 14:51:19] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "3840", "eta": "1:17:40", "loss": 0.069308, "lr": 0.036317, "mode": "train", "time_backward": 1.067231, "time_data": 0.017680, "time_diff": 1.488374, "time_forward": 0.400136, "time_loss": 0.000376}
[03/28 14:51:35] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "3850", "eta": "1:17:20", "loss": 0.061173, "lr": 0.036334, "mode": "train", "time_backward": 1.076099, "time_data": 0.018498, "time_diff": 1.528374, "time_forward": 0.398891, "time_loss": 0.000244}
[03/28 14:52:04] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "3860", "eta": "1:16:58", "loss": 0.064093, "lr": 0.036350, "mode": "train", "time_backward": 1.055352, "time_data": 0.018161, "time_diff": 1.484148, "time_forward": 0.401728, "time_loss": 0.000693}
[03/28 14:52:51] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "3870", "eta": "1:16:38", "loss": 0.062808, "lr": 0.036366, "mode": "train", "time_backward": 1.175580, "time_data": 0.018687, "time_diff": 1.636917, "time_forward": 0.400100, "time_loss": 0.000266}
[03/28 14:53:09] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "3880", "eta": "1:16:20", "loss": 0.067715, "lr": 0.036383, "mode": "train", "time_backward": 1.192279, "time_data": 0.017238, "time_diff": 1.965325, "time_forward": 0.398965, "time_loss": 0.000251}
[03/28 14:53:28] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "3890", "eta": "1:15:59", "loss": 0.064037, "lr": 0.036399, "mode": "train", "time_backward": 1.057905, "time_data": 0.016965, "time_diff": 1.489993, "time_forward": 0.401423, "time_loss": 0.000342}
[03/28 14:53:42] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "3900", "eta": "1:15:38", "loss": 0.061794, "lr": 0.036415, "mode": "train", "time_backward": 1.059157, "time_data": 0.017428, "time_diff": 1.481407, "time_forward": 0.399560, "time_loss": 0.000280}
[03/28 14:53:58] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "3910", "eta": "1:15:18", "loss": 0.067104, "lr": 0.036432, "mode": "train", "time_backward": 1.127566, "time_data": 0.020358, "time_diff": 1.597230, "time_forward": 0.398767, "time_loss": 0.000316}
[03/28 14:54:14] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "3920", "eta": "1:14:58", "loss": 0.066563, "lr": 0.036448, "mode": "train", "time_backward": 1.159735, "time_data": 0.026165, "time_diff": 1.608910, "time_forward": 0.420676, "time_loss": 0.002032}
[03/28 14:54:30] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "3930", "eta": "1:14:37", "loss": 0.066926, "lr": 0.036464, "mode": "train", "time_backward": 1.096855, "time_data": 0.017514, "time_diff": 1.522412, "time_forward": 0.399704, "time_loss": 0.000272}
[03/28 14:54:51] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "3940", "eta": "1:14:16", "loss": 0.065740, "lr": 0.036481, "mode": "train", "time_backward": 1.079464, "time_data": 0.017124, "time_diff": 1.514597, "time_forward": 0.410625, "time_loss": 0.000378}
[03/28 14:55:10] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "3950", "eta": "1:13:56", "loss": 0.065266, "lr": 0.036497, "mode": "train", "time_backward": 1.065247, "time_data": 0.018638, "time_diff": 1.513533, "time_forward": 0.406340, "time_loss": 0.000351}
[03/28 14:55:26] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "3960", "eta": "1:13:36", "loss": 0.061353, "lr": 0.036513, "mode": "train", "time_backward": 1.169705, "time_data": 0.022933, "time_diff": 1.647145, "time_forward": 0.416020, "time_loss": 0.000233}
[03/28 14:55:52] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "3970", "eta": "1:13:16", "loss": 0.061763, "lr": 0.036530, "mode": "train", "time_backward": 1.075089, "time_data": 0.017956, "time_diff": 1.506863, "time_forward": 0.410841, "time_loss": 0.000407}
[03/28 14:56:10] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "3980", "eta": "1:12:55", "loss": 0.067489, "lr": 0.036546, "mode": "train", "time_backward": 1.078958, "time_data": 0.033512, "time_diff": 1.738957, "time_forward": 0.615679, "time_loss": 0.000451}
[03/28 14:56:31] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "3990", "eta": "1:12:35", "loss": 0.063997, "lr": 0.036562, "mode": "train", "time_backward": 1.071421, "time_data": 0.049981, "time_diff": 1.525014, "time_forward": 0.398273, "time_loss": 0.000264}
[03/28 14:56:46] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "4000", "eta": "1:12:15", "loss": 0.065405, "lr": 0.036579, "mode": "train", "time_backward": 1.213335, "time_data": 0.017267, "time_diff": 1.722492, "time_forward": 0.408485, "time_loss": 0.000245}
[03/28 14:57:02] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "4010", "eta": "1:11:54", "loss": 0.065341, "lr": 0.036595, "mode": "train", "time_backward": 1.101876, "time_data": 0.016955, "time_diff": 1.522589, "time_forward": 0.399483, "time_loss": 0.000306}
[03/28 14:57:17] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "4020", "eta": "1:11:21", "loss": 0.062106, "lr": 0.036611, "mode": "train", "time_backward": 1.108182, "time_data": 0.018543, "time_diff": 1.528795, "time_forward": 0.398138, "time_loss": 0.000225}
[03/28 14:57:34] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "4030", "eta": "1:11:01", "loss": 0.065930, "lr": 0.036628, "mode": "train", "time_backward": 1.082561, "time_data": 0.027792, "time_diff": 1.553785, "time_forward": 0.439694, "time_loss": 0.000410}
[03/28 14:57:50] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "4040", "eta": "1:10:40", "loss": 0.067942, "lr": 0.036644, "mode": "train", "time_backward": 1.109796, "time_data": 0.018122, "time_diff": 1.533122, "time_forward": 0.402499, "time_loss": 0.000258}
[03/28 14:58:06] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "4050", "eta": "1:10:20", "loss": 0.060910, "lr": 0.036660, "mode": "train", "time_backward": 1.110983, "time_data": 0.017487, "time_diff": 1.576338, "time_forward": 0.399915, "time_loss": 0.000365}
[03/28 14:58:21] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "4060", "eta": "1:10:00", "loss": 0.069652, "lr": 0.036677, "mode": "train", "time_backward": 1.169280, "time_data": 0.017365, "time_diff": 1.634767, "time_forward": 0.404411, "time_loss": 0.000339}
[03/28 14:58:40] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "4070", "eta": "1:09:40", "loss": 0.062228, "lr": 0.036693, "mode": "train", "time_backward": 1.117387, "time_data": 0.018520, "time_diff": 1.641535, "time_forward": 0.408699, "time_loss": 0.000322}
[03/28 14:58:56] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "4080", "eta": "1:09:20", "loss": 0.063546, "lr": 0.036709, "mode": "train", "time_backward": 1.053283, "time_data": 0.111011, "time_diff": 1.675762, "time_forward": 0.507805, "time_loss": 0.000252}
[03/28 14:59:12] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "4090", "eta": "1:08:59", "loss": 0.067106, "lr": 0.036726, "mode": "train", "time_backward": 1.086784, "time_data": 0.017038, "time_diff": 1.526714, "time_forward": 0.404162, "time_loss": 0.002823}
[03/28 14:59:27] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "4100", "eta": "1:08:38", "loss": 0.056436, "lr": 0.036742, "mode": "train", "time_backward": 1.080041, "time_data": 0.016757, "time_diff": 1.538424, "time_forward": 0.435316, "time_loss": 0.000319}
[03/28 14:59:45] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "4110", "eta": "1:08:17", "loss": 0.068747, "lr": 0.036758, "mode": "train", "time_backward": 1.134384, "time_data": 0.018817, "time_diff": 1.587435, "time_forward": 0.407239, "time_loss": 0.000280}
[03/28 15:00:00] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "4120", "eta": "1:07:57", "loss": 0.061634, "lr": 0.036775, "mode": "train", "time_backward": 1.080872, "time_data": 0.018321, "time_diff": 1.523109, "time_forward": 0.421644, "time_loss": 0.001420}
[03/28 15:00:30] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "4130", "eta": "1:07:36", "loss": 0.064726, "lr": 0.036791, "mode": "train", "time_backward": 1.056100, "time_data": 0.017366, "time_diff": 1.484626, "time_forward": 0.400173, "time_loss": 0.000303}
[03/28 15:00:48] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "4140", "eta": "1:07:25", "loss": 0.062020, "lr": 0.036807, "mode": "train", "time_backward": 1.227884, "time_data": 1.664072, "time_diff": 3.890562, "time_forward": 0.919591, "time_loss": 0.022035}
[03/28 15:01:08] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "4150", "eta": "1:07:05", "loss": 0.064747, "lr": 0.036824, "mode": "train", "time_backward": 1.177465, "time_data": 0.020556, "time_diff": 1.621801, "time_forward": 0.418131, "time_loss": 0.000252}
[03/28 15:01:23] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "4160", "eta": "1:06:44", "loss": 0.061923, "lr": 0.036840, "mode": "train", "time_backward": 1.065514, "time_data": 0.016766, "time_diff": 1.494053, "time_forward": 0.399878, "time_loss": 0.000357}
[03/28 15:01:43] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "4170", "eta": "1:06:23", "loss": 0.062941, "lr": 0.036856, "mode": "train", "time_backward": 1.059965, "time_data": 0.016721, "time_diff": 1.481839, "time_forward": 0.398939, "time_loss": 0.000252}
[03/28 15:02:20] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "4180", "eta": "1:07:26", "loss": 0.068915, "lr": 0.036873, "mode": "train", "time_backward": 22.435399, "time_data": 0.017233, "time_diff": 23.180154, "time_forward": 0.618132, "time_loss": 0.001161}
[03/28 15:02:35] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "4190", "eta": "1:07:04", "loss": 0.063243, "lr": 0.036889, "mode": "train", "time_backward": 1.055459, "time_data": 0.017531, "time_diff": 1.480786, "time_forward": 0.404366, "time_loss": 0.000254}
[03/28 15:02:50] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "4200", "eta": "1:06:43", "loss": 0.064368, "lr": 0.036905, "mode": "train", "time_backward": 1.057327, "time_data": 0.031282, "time_diff": 1.501452, "time_forward": 0.401348, "time_loss": 0.000404}
[03/28 15:03:05] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "4210", "eta": "1:06:22", "loss": 0.067844, "lr": 0.036921, "mode": "train", "time_backward": 1.072180, "time_data": 0.017304, "time_diff": 1.499967, "time_forward": 0.400865, "time_loss": 0.000432}
[03/28 15:03:23] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "4220", "eta": "1:06:01", "loss": 0.064642, "lr": 0.036938, "mode": "train", "time_backward": 1.090480, "time_data": 0.016778, "time_diff": 1.596315, "time_forward": 0.488152, "time_loss": 0.000607}
[03/28 15:03:39] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "4230", "eta": "1:05:40", "loss": 0.069424, "lr": 0.036954, "mode": "train", "time_backward": 1.152786, "time_data": 0.018485, "time_diff": 1.591847, "time_forward": 0.406577, "time_loss": 0.000409}
[03/28 15:04:00] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "4240", "eta": "1:05:19", "loss": 0.064180, "lr": 0.036970, "mode": "train", "time_backward": 1.112672, "time_data": 0.026383, "time_diff": 1.548327, "time_forward": 0.400207, "time_loss": 0.000261}
[03/28 15:04:17] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "4250", "eta": "1:04:59", "loss": 0.067143, "lr": 0.036987, "mode": "train", "time_backward": 1.113615, "time_data": 0.026326, "time_diff": 1.557411, "time_forward": 0.399360, "time_loss": 0.000227}
[03/28 15:04:33] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "4260", "eta": "1:04:39", "loss": 0.065760, "lr": 0.037003, "mode": "train", "time_backward": 1.126204, "time_data": 0.017402, "time_diff": 1.831678, "time_forward": 0.684317, "time_loss": 0.000390}
[03/28 15:04:48] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "4270", "eta": "1:04:13", "loss": 0.062398, "lr": 0.037019, "mode": "train", "time_backward": 1.060049, "time_data": 0.045320, "time_diff": 1.547664, "time_forward": 0.434333, "time_loss": 0.000986}
[03/28 15:05:08] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "4280", "eta": "1:03:52", "loss": 0.067967, "lr": 0.037036, "mode": "train", "time_backward": 1.055603, "time_data": 0.032611, "time_diff": 1.576264, "time_forward": 0.440026, "time_loss": 0.001713}
[03/28 15:05:23] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "4290", "eta": "1:03:31", "loss": 0.070384, "lr": 0.037052, "mode": "train", "time_backward": 1.085337, "time_data": 0.021837, "time_diff": 1.529091, "time_forward": 0.414479, "time_loss": 0.000271}
[03/28 15:05:46] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "4300", "eta": "1:03:11", "loss": 0.058967, "lr": 0.037068, "mode": "train", "time_backward": 1.122016, "time_data": 0.019642, "time_diff": 1.821133, "time_forward": 0.672424, "time_loss": 0.001051}
[03/28 15:06:16] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "4310", "eta": "1:03:41", "loss": 0.062198, "lr": 0.037085, "mode": "train", "time_backward": 15.591465, "time_data": 0.017314, "time_diff": 16.014965, "time_forward": 0.401460, "time_loss": 0.000228}
[03/28 15:06:39] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "4320", "eta": "1:03:20", "loss": 0.067498, "lr": 0.037101, "mode": "train", "time_backward": 1.068005, "time_data": 0.017040, "time_diff": 1.489638, "time_forward": 0.400733, "time_loss": 0.000496}
[03/28 15:06:58] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "4330", "eta": "1:02:58", "loss": 0.062603, "lr": 0.037117, "mode": "train", "time_backward": 1.057674, "time_data": 0.017335, "time_diff": 1.509493, "time_forward": 0.399345, "time_loss": 0.000363}
[03/28 15:07:39] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "4340", "eta": "1:02:37", "loss": 0.061448, "lr": 0.037134, "mode": "train", "time_backward": 1.057600, "time_data": 0.017904, "time_diff": 1.496279, "time_forward": 0.401505, "time_loss": 0.000377}
[03/28 15:07:54] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "4350", "eta": "1:02:15", "loss": 0.064635, "lr": 0.037150, "mode": "train", "time_backward": 1.056731, "time_data": 0.016951, "time_diff": 1.506488, "time_forward": 0.399590, "time_loss": 0.000382}
[03/28 15:08:10] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "4360", "eta": "1:01:54", "loss": 0.067834, "lr": 0.037166, "mode": "train", "time_backward": 1.064537, "time_data": 0.018073, "time_diff": 1.501155, "time_forward": 0.415447, "time_loss": 0.000376}
[03/28 15:08:26] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "4370", "eta": "1:01:33", "loss": 0.064366, "lr": 0.037183, "mode": "train", "time_backward": 1.057825, "time_data": 0.017269, "time_diff": 1.531590, "time_forward": 0.447258, "time_loss": 0.000251}
[03/28 15:08:48] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "4380", "eta": "1:01:11", "loss": 0.067302, "lr": 0.037199, "mode": "train", "time_backward": 1.078880, "time_data": 0.019088, "time_diff": 1.505873, "time_forward": 0.400403, "time_loss": 0.000406}
[03/28 15:09:04] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "4390", "eta": "1:00:50", "loss": 0.062582, "lr": 0.037215, "mode": "train", "time_backward": 1.055090, "time_data": 0.020295, "time_diff": 1.513431, "time_forward": 0.434452, "time_loss": 0.000371}
[03/28 15:09:28] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "4400", "eta": "1:00:28", "loss": 0.064050, "lr": 0.037232, "mode": "train", "time_backward": 1.099896, "time_data": 0.022024, "time_diff": 1.563350, "time_forward": 0.435254, "time_loss": 0.000547}
[03/28 15:09:44] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "4410", "eta": "1:00:07", "loss": 0.061981, "lr": 0.037248, "mode": "train", "time_backward": 1.059006, "time_data": 0.017651, "time_diff": 1.516138, "time_forward": 0.400197, "time_loss": 0.000428}
[03/28 15:10:16] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "4420", "eta": "0:59:46", "loss": 0.060816, "lr": 0.037264, "mode": "train", "time_backward": 1.063197, "time_data": 0.017314, "time_diff": 1.481042, "time_forward": 0.399857, "time_loss": 0.000347}
[03/28 15:10:34] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "4430", "eta": "0:59:24", "loss": 0.062184, "lr": 0.037281, "mode": "train", "time_backward": 1.095979, "time_data": 0.050437, "time_diff": 1.603623, "time_forward": 0.403218, "time_loss": 0.000420}
[03/28 15:10:51] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "4440", "eta": "0:59:03", "loss": 0.066547, "lr": 0.037297, "mode": "train", "time_backward": 1.078324, "time_data": 0.017240, "time_diff": 1.510856, "time_forward": 0.407424, "time_loss": 0.000777}
[03/28 15:11:11] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "4450", "eta": "0:58:41", "loss": 0.072131, "lr": 0.037313, "mode": "train", "time_backward": 1.079800, "time_data": 0.018136, "time_diff": 1.527266, "time_forward": 0.424455, "time_loss": 0.000289}
[03/28 15:11:42] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "4460", "eta": "0:58:20", "loss": 0.066983, "lr": 0.037330, "mode": "train", "time_backward": 1.055593, "time_data": 0.017265, "time_diff": 1.519834, "time_forward": 0.443531, "time_loss": 0.000245}
[03/28 15:12:35] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "4470", "eta": "0:57:59", "loss": 0.061248, "lr": 0.037346, "mode": "train", "time_backward": 1.057512, "time_data": 0.018986, "time_diff": 1.485948, "time_forward": 0.399009, "time_loss": 0.000228}
[03/28 15:12:51] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "4480", "eta": "0:57:38", "loss": 0.064063, "lr": 0.037362, "mode": "train", "time_backward": 1.065399, "time_data": 0.017746, "time_diff": 1.530146, "time_forward": 0.399275, "time_loss": 0.000263}
[03/28 15:13:10] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "4490", "eta": "0:57:17", "loss": 0.069489, "lr": 0.037379, "mode": "train", "time_backward": 1.142252, "time_data": 0.017056, "time_diff": 1.568896, "time_forward": 0.398600, "time_loss": 0.000261}
[03/28 15:13:26] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "4500", "eta": "0:56:55", "loss": 0.063308, "lr": 0.037395, "mode": "train", "time_backward": 1.052939, "time_data": 0.016736, "time_diff": 1.537256, "time_forward": 0.448610, "time_loss": 0.000262}
[03/28 15:13:42] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "4510", "eta": "0:56:34", "loss": 0.064799, "lr": 0.037411, "mode": "train", "time_backward": 1.102954, "time_data": 0.017117, "time_diff": 1.527818, "time_forward": 0.400492, "time_loss": 0.000413}
[03/28 15:14:11] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "4520", "eta": "0:56:14", "loss": 0.063389, "lr": 0.037428, "mode": "train", "time_backward": 1.163218, "time_data": 0.017444, "time_diff": 1.647961, "time_forward": 0.400187, "time_loss": 0.000380}
[03/28 15:14:29] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "4530", "eta": "0:55:52", "loss": 0.063165, "lr": 0.037444, "mode": "train", "time_backward": 1.060718, "time_data": 0.018318, "time_diff": 1.502180, "time_forward": 0.416290, "time_loss": 0.000361}
[03/28 15:14:45] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "4540", "eta": "0:55:31", "loss": 0.059127, "lr": 0.037460, "mode": "train", "time_backward": 1.152813, "time_data": 0.032462, "time_diff": 1.590907, "time_forward": 0.400242, "time_loss": 0.002152}
[03/28 15:15:05] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "4550", "eta": "0:55:10", "loss": 0.064279, "lr": 0.037477, "mode": "train", "time_backward": 1.060657, "time_data": 0.017523, "time_diff": 1.493178, "time_forward": 0.402320, "time_loss": 0.000399}
[03/28 15:15:35] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "4560", "eta": "0:54:49", "loss": 0.067436, "lr": 0.037493, "mode": "train", "time_backward": 1.060482, "time_data": 0.017236, "time_diff": 1.482396, "time_forward": 0.400788, "time_loss": 0.000575}
[03/28 15:15:50] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "4570", "eta": "0:54:27", "loss": 0.070777, "lr": 0.037509, "mode": "train", "time_backward": 1.092268, "time_data": 0.024061, "time_diff": 1.543616, "time_forward": 0.401991, "time_loss": 0.000334}
[03/28 15:16:06] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "4580", "eta": "0:54:06", "loss": 0.059768, "lr": 0.037526, "mode": "train", "time_backward": 1.074218, "time_data": 0.017010, "time_diff": 1.548518, "time_forward": 0.453747, "time_loss": 0.000318}
[03/28 15:16:24] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "4590", "eta": "0:53:45", "loss": 0.063740, "lr": 0.037542, "mode": "train", "time_backward": 1.056127, "time_data": 0.017182, "time_diff": 1.499818, "time_forward": 0.400931, "time_loss": 0.000297}
[03/28 15:16:39] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "4600", "eta": "0:53:24", "loss": 0.061540, "lr": 0.037558, "mode": "train", "time_backward": 1.080297, "time_data": 0.018267, "time_diff": 1.565969, "time_forward": 0.443384, "time_loss": 0.000951}
[03/28 15:16:55] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "4610", "eta": "0:53:03", "loss": 0.063696, "lr": 0.037575, "mode": "train", "time_backward": 1.063303, "time_data": 0.019000, "time_diff": 1.539354, "time_forward": 0.398396, "time_loss": 0.000245}
[03/28 15:17:10] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "4620", "eta": "0:52:41", "loss": 0.068893, "lr": 0.037591, "mode": "train", "time_backward": 1.125537, "time_data": 0.023059, "time_diff": 1.576469, "time_forward": 0.421634, "time_loss": 0.000309}
[03/28 15:17:26] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "4630", "eta": "0:52:20", "loss": 0.066898, "lr": 0.037607, "mode": "train", "time_backward": 1.055892, "time_data": 0.017357, "time_diff": 1.481604, "time_forward": 0.399345, "time_loss": 0.000281}
[03/28 15:17:53] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "4640", "eta": "0:51:59", "loss": 0.065676, "lr": 0.037623, "mode": "train", "time_backward": 1.057390, "time_data": 0.017167, "time_diff": 1.479956, "time_forward": 0.400521, "time_loss": 0.000242}
[03/28 15:18:09] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "4650", "eta": "0:51:29", "loss": 0.066128, "lr": 0.037640, "mode": "train", "time_backward": 1.070032, "time_data": 0.016740, "time_diff": 1.494049, "time_forward": 0.398433, "time_loss": 0.000249}
[03/28 15:18:29] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "4660", "eta": "0:51:08", "loss": 0.072294, "lr": 0.037656, "mode": "train", "time_backward": 1.096677, "time_data": 0.026781, "time_diff": 1.532280, "time_forward": 0.400185, "time_loss": 0.000320}
[03/28 15:18:44] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "4670", "eta": "0:50:47", "loss": 0.058071, "lr": 0.037672, "mode": "train", "time_backward": 1.054896, "time_data": 0.017457, "time_diff": 1.504028, "time_forward": 0.429952, "time_loss": 0.000253}
[03/28 15:19:01] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "4680", "eta": "0:50:28", "loss": 0.065484, "lr": 0.037689, "mode": "train", "time_backward": 1.150614, "time_data": 0.016857, "time_diff": 2.369923, "time_forward": 1.198696, "time_loss": 0.000439}
[03/28 15:19:16] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "4690", "eta": "0:50:07", "loss": 0.064914, "lr": 0.037705, "mode": "train", "time_backward": 1.083489, "time_data": 0.018450, "time_diff": 1.608436, "time_forward": 0.493513, "time_loss": 0.001011}
[03/28 15:19:32] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "4700", "eta": "0:49:46", "loss": 0.062025, "lr": 0.037721, "mode": "train", "time_backward": 1.116568, "time_data": 0.022065, "time_diff": 1.604652, "time_forward": 0.453847, "time_loss": 0.000354}
[03/28 15:19:55] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "4710", "eta": "0:49:25", "loss": 0.063584, "lr": 0.037738, "mode": "train", "time_backward": 1.060002, "time_data": 0.017283, "time_diff": 1.493517, "time_forward": 0.400988, "time_loss": 0.000569}
[03/28 15:20:11] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "4720", "eta": "0:49:04", "loss": 0.059955, "lr": 0.037754, "mode": "train", "time_backward": 1.098780, "time_data": 0.017307, "time_diff": 1.525716, "time_forward": 0.400584, "time_loss": 0.000508}
[03/28 15:20:26] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "4730", "eta": "0:48:43", "loss": 0.063914, "lr": 0.037770, "mode": "train", "time_backward": 1.058563, "time_data": 0.016787, "time_diff": 1.486216, "time_forward": 0.404273, "time_loss": 0.000683}
[03/28 15:20:47] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "4740", "eta": "0:48:21", "loss": 0.066489, "lr": 0.037787, "mode": "train", "time_backward": 1.095504, "time_data": 0.016788, "time_diff": 1.523112, "time_forward": 0.407230, "time_loss": 0.000322}
[03/28 15:21:08] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "4750", "eta": "0:48:00", "loss": 0.058949, "lr": 0.037803, "mode": "train", "time_backward": 1.058407, "time_data": 0.033477, "time_diff": 1.589268, "time_forward": 0.491202, "time_loss": 0.000331}
[03/28 15:21:23] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "4760", "eta": "0:47:39", "loss": 0.061835, "lr": 0.037819, "mode": "train", "time_backward": 1.055864, "time_data": 0.017842, "time_diff": 1.482612, "time_forward": 0.400290, "time_loss": 0.000337}
[03/28 15:21:43] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "4770", "eta": "0:46:29", "loss": 0.058289, "lr": 0.037836, "mode": "train", "time_backward": 1.077092, "time_data": 0.018222, "time_diff": 1.500239, "time_forward": 0.399825, "time_loss": 0.000706}
[03/28 15:22:18] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "4780", "eta": "0:46:08", "loss": 0.062742, "lr": 0.037852, "mode": "train", "time_backward": 1.058362, "time_data": 0.016987, "time_diff": 1.510941, "time_forward": 0.399482, "time_loss": 0.000276}
[03/28 15:23:04] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "4790", "eta": "0:45:54", "loss": 0.063283, "lr": 0.037868, "mode": "train", "time_backward": 3.819210, "time_data": 0.020111, "time_diff": 4.254378, "time_forward": 0.402597, "time_loss": 0.000600}
[03/28 15:23:22] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "4800", "eta": "0:45:33", "loss": 0.063654, "lr": 0.037885, "mode": "train", "time_backward": 1.057068, "time_data": 0.017485, "time_diff": 1.479548, "time_forward": 0.401470, "time_loss": 0.000259}
[03/28 15:23:47] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "4810", "eta": "0:45:13", "loss": 0.068587, "lr": 0.037901, "mode": "train", "time_backward": 1.081028, "time_data": 0.017269, "time_diff": 1.724656, "time_forward": 0.621092, "time_loss": 0.000677}
[03/28 15:24:03] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "4820", "eta": "0:44:52", "loss": 0.060679, "lr": 0.037917, "mode": "train", "time_backward": 1.181687, "time_data": 0.016782, "time_diff": 1.617590, "time_forward": 0.415680, "time_loss": 0.000356}
[03/28 15:24:19] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "4830", "eta": "0:44:32", "loss": 0.070951, "lr": 0.037934, "mode": "train", "time_backward": 1.091603, "time_data": 0.017144, "time_diff": 1.513295, "time_forward": 0.400871, "time_loss": 0.000321}
[03/28 15:24:40] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "4840", "eta": "0:44:11", "loss": 0.061271, "lr": 0.037950, "mode": "train", "time_backward": 1.130951, "time_data": 0.030563, "time_diff": 1.609820, "time_forward": 0.405576, "time_loss": 0.000397}
[03/28 15:24:55] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "4850", "eta": "0:43:50", "loss": 0.063231, "lr": 0.037966, "mode": "train", "time_backward": 1.106536, "time_data": 0.017391, "time_diff": 1.529486, "time_forward": 0.400474, "time_loss": 0.000344}
[03/28 15:25:20] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "4860", "eta": "0:43:29", "loss": 0.062776, "lr": 0.037983, "mode": "train", "time_backward": 1.093170, "time_data": 0.019164, "time_diff": 1.584782, "time_forward": 0.460182, "time_loss": 0.000262}
[03/28 15:25:36] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "4870", "eta": "0:43:09", "loss": 0.069117, "lr": 0.037999, "mode": "train", "time_backward": 1.134694, "time_data": 0.017649, "time_diff": 1.556244, "time_forward": 0.400312, "time_loss": 0.000266}
[03/28 15:25:52] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "4880", "eta": "0:42:48", "loss": 0.061002, "lr": 0.038015, "mode": "train", "time_backward": 1.088389, "time_data": 0.017987, "time_diff": 1.523669, "time_forward": 0.408691, "time_loss": 0.005240}
[03/28 15:26:10] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "4890", "eta": "0:42:27", "loss": 0.070426, "lr": 0.038032, "mode": "train", "time_backward": 1.085248, "time_data": 0.017321, "time_diff": 1.512219, "time_forward": 0.409146, "time_loss": 0.000240}
[03/28 15:26:26] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "4900", "eta": "0:42:06", "loss": 0.063224, "lr": 0.038048, "mode": "train", "time_backward": 1.078878, "time_data": 0.019302, "time_diff": 1.515752, "time_forward": 0.414668, "time_loss": 0.000394}
[03/28 15:26:41] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "4910", "eta": "0:41:46", "loss": 0.067347, "lr": 0.038064, "mode": "train", "time_backward": 1.060933, "time_data": 0.030362, "time_diff": 1.620664, "time_forward": 0.512179, "time_loss": 0.000424}
[03/28 15:26:57] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "4920", "eta": "0:41:25", "loss": 0.059352, "lr": 0.038081, "mode": "train", "time_backward": 1.137816, "time_data": 0.022249, "time_diff": 1.569528, "time_forward": 0.403064, "time_loss": 0.000382}
[03/28 15:27:19] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "4930", "eta": "0:41:03", "loss": 0.060862, "lr": 0.038097, "mode": "train", "time_backward": 1.061579, "time_data": 0.018226, "time_diff": 1.485924, "time_forward": 0.402881, "time_loss": 0.000751}
[03/28 15:27:35] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "4940", "eta": "0:40:42", "loss": 0.061046, "lr": 0.038113, "mode": "train", "time_backward": 1.095152, "time_data": 0.023255, "time_diff": 1.668543, "time_forward": 0.543500, "time_loss": 0.000478}
[03/28 15:27:51] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "4950", "eta": "0:40:22", "loss": 0.062468, "lr": 0.038130, "mode": "train", "time_backward": 1.074153, "time_data": 0.017273, "time_diff": 1.495495, "time_forward": 0.398855, "time_loss": 0.000340}
[03/28 15:28:06] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "4960", "eta": "0:40:01", "loss": 0.066054, "lr": 0.038146, "mode": "train", "time_backward": 1.053307, "time_data": 0.019607, "time_diff": 1.479821, "time_forward": 0.400358, "time_loss": 0.000251}
[03/28 15:28:23] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "4970", "eta": "0:39:40", "loss": 0.061597, "lr": 0.038162, "mode": "train", "time_backward": 1.054138, "time_data": 0.017863, "time_diff": 1.478704, "time_forward": 0.399065, "time_loss": 0.000335}
[03/28 15:28:49] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "4980", "eta": "0:39:19", "loss": 0.060795, "lr": 0.038179, "mode": "train", "time_backward": 1.055932, "time_data": 0.017581, "time_diff": 1.482216, "time_forward": 0.398999, "time_loss": 0.000252}
[03/28 15:29:15] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "4990", "eta": "0:38:58", "loss": 0.060917, "lr": 0.038195, "mode": "train", "time_backward": 1.104198, "time_data": 0.017267, "time_diff": 1.523519, "time_forward": 0.400518, "time_loss": 0.000256}
[03/28 15:29:46] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "5000", "eta": "0:38:37", "loss": 0.062783, "lr": 0.038211, "mode": "train", "time_backward": 1.059788, "time_data": 0.017474, "time_diff": 1.489898, "time_forward": 0.401339, "time_loss": 0.000366}
[03/28 15:30:10] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "5010", "eta": "0:38:16", "loss": 0.061560, "lr": 0.038228, "mode": "train", "time_backward": 1.072495, "time_data": 0.024463, "time_diff": 1.506597, "time_forward": 0.402750, "time_loss": 0.000398}
[03/28 15:30:32] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "5020", "eta": "0:37:55", "loss": 0.059794, "lr": 0.038244, "mode": "train", "time_backward": 1.109244, "time_data": 0.017681, "time_diff": 1.534877, "time_forward": 0.399188, "time_loss": 0.000269}
[03/28 15:30:49] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "5030", "eta": "0:37:35", "loss": 0.062299, "lr": 0.038260, "mode": "train", "time_backward": 1.059775, "time_data": 0.022584, "time_diff": 1.613363, "time_forward": 0.520474, "time_loss": 0.000456}
[03/28 15:31:04] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "5040", "eta": "0:37:14", "loss": 0.062673, "lr": 0.038277, "mode": "train", "time_backward": 1.055582, "time_data": 0.017024, "time_diff": 1.482553, "time_forward": 0.400238, "time_loss": 0.000709}
[03/28 15:31:33] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "5050", "eta": "0:36:53", "loss": 0.060709, "lr": 0.038293, "mode": "train", "time_backward": 1.181227, "time_data": 0.017113, "time_diff": 1.634367, "time_forward": 0.400854, "time_loss": 0.000258}
[03/28 15:31:49] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "5060", "eta": "0:36:32", "loss": 0.062049, "lr": 0.038309, "mode": "train", "time_backward": 1.115333, "time_data": 0.018523, "time_diff": 1.564892, "time_forward": 0.401555, "time_loss": 0.000279}
[03/28 15:32:05] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "5070", "eta": "0:36:11", "loss": 0.065345, "lr": 0.038325, "mode": "train", "time_backward": 1.054636, "time_data": 0.017991, "time_diff": 1.513539, "time_forward": 0.404249, "time_loss": 0.000352}
[03/28 15:32:21] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "5080", "eta": "0:35:50", "loss": 0.063295, "lr": 0.038342, "mode": "train", "time_backward": 1.072611, "time_data": 0.017242, "time_diff": 1.499282, "time_forward": 0.400942, "time_loss": 0.000274}
[03/28 15:32:37] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "5090", "eta": "0:35:30", "loss": 0.066834, "lr": 0.038358, "mode": "train", "time_backward": 1.197389, "time_data": 0.017559, "time_diff": 1.852202, "time_forward": 0.634946, "time_loss": 0.000368}
[03/28 15:32:53] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "5100", "eta": "0:35:05", "loss": 0.063858, "lr": 0.038374, "mode": "train", "time_backward": 1.066956, "time_data": 0.016940, "time_diff": 1.625572, "time_forward": 0.450964, "time_loss": 0.000284}
[03/28 15:33:09] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "5110", "eta": "0:34:44", "loss": 0.066621, "lr": 0.038391, "mode": "train", "time_backward": 1.115529, "time_data": 0.017068, "time_diff": 1.584683, "time_forward": 0.400711, "time_loss": 0.000368}
[03/28 15:33:24] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "5120", "eta": "0:34:23", "loss": 0.063408, "lr": 0.038407, "mode": "train", "time_backward": 1.137171, "time_data": 0.017346, "time_diff": 1.584311, "time_forward": 0.399451, "time_loss": 0.000243}
[03/28 15:33:40] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "5130", "eta": "0:34:02", "loss": 0.067441, "lr": 0.038423, "mode": "train", "time_backward": 1.074932, "time_data": 0.016829, "time_diff": 1.507853, "time_forward": 0.398823, "time_loss": 0.000358}
[03/28 15:34:00] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "5140", "eta": "0:33:41", "loss": 0.065643, "lr": 0.038440, "mode": "train", "time_backward": 1.061913, "time_data": 0.017326, "time_diff": 1.483544, "time_forward": 0.400304, "time_loss": 0.000629}
[03/28 15:34:16] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "5150", "eta": "0:33:21", "loss": 0.063169, "lr": 0.038456, "mode": "train", "time_backward": 1.060408, "time_data": 0.065587, "time_diff": 1.540109, "time_forward": 0.404845, "time_loss": 0.000346}
[03/28 15:34:32] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "5160", "eta": "0:33:00", "loss": 0.063335, "lr": 0.038472, "mode": "train", "time_backward": 1.060812, "time_data": 0.027127, "time_diff": 1.532695, "time_forward": 0.441788, "time_loss": 0.000531}
[03/28 15:34:48] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "5170", "eta": "0:32:39", "loss": 0.064253, "lr": 0.038489, "mode": "train", "time_backward": 1.066701, "time_data": 0.017052, "time_diff": 1.492699, "time_forward": 0.405393, "time_loss": 0.000302}
[03/28 15:35:11] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "5180", "eta": "0:32:18", "loss": 0.063075, "lr": 0.038505, "mode": "train", "time_backward": 1.067026, "time_data": 0.021370, "time_diff": 1.506480, "time_forward": 0.413824, "time_loss": 0.000252}
[03/28 15:35:26] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "5190", "eta": "0:31:57", "loss": 0.063384, "lr": 0.038521, "mode": "train", "time_backward": 1.073050, "time_data": 0.017546, "time_diff": 1.511765, "time_forward": 0.398712, "time_loss": 0.000264}
[03/28 15:35:42] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "5200", "eta": "0:31:37", "loss": 0.060632, "lr": 0.038538, "mode": "train", "time_backward": 1.470831, "time_data": 0.017125, "time_diff": 1.966888, "time_forward": 0.438941, "time_loss": 0.005971}
[03/28 15:35:58] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "5210", "eta": "0:31:16", "loss": 0.061397, "lr": 0.038554, "mode": "train", "time_backward": 1.110247, "time_data": 0.025199, "time_diff": 1.587891, "time_forward": 0.440780, "time_loss": 0.000303}
[03/28 15:36:14] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "5220", "eta": "0:30:56", "loss": 0.061469, "lr": 0.038570, "mode": "train", "time_backward": 1.180610, "time_data": 0.017547, "time_diff": 1.622497, "time_forward": 0.405193, "time_loss": 0.000408}
[03/28 15:36:30] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "5230", "eta": "0:30:35", "loss": 0.060441, "lr": 0.038587, "mode": "train", "time_backward": 1.084455, "time_data": 0.018959, "time_diff": 1.515156, "time_forward": 0.399087, "time_loss": 0.000330}
[03/28 15:36:46] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "5240", "eta": "0:30:14", "loss": 0.057517, "lr": 0.038603, "mode": "train", "time_backward": 1.053164, "time_data": 0.016729, "time_diff": 1.491617, "time_forward": 0.401869, "time_loss": 0.000299}
[03/28 15:37:01] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "5250", "eta": "0:29:53", "loss": 0.064063, "lr": 0.038619, "mode": "train", "time_backward": 1.053950, "time_data": 0.017935, "time_diff": 1.483719, "time_forward": 0.400350, "time_loss": 0.000285}
[03/28 15:37:16] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "5260", "eta": "0:29:02", "loss": 0.062541, "lr": 0.038636, "mode": "train", "time_backward": 1.063631, "time_data": 0.017636, "time_diff": 1.521517, "time_forward": 0.399449, "time_loss": 0.000402}
[03/28 15:37:32] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "5270", "eta": "0:28:42", "loss": 0.062014, "lr": 0.038652, "mode": "train", "time_backward": 1.134016, "time_data": 0.017248, "time_diff": 1.607726, "time_forward": 0.406210, "time_loss": 0.000288}
[03/28 15:37:49] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "5280", "eta": "0:28:21", "loss": 0.060062, "lr": 0.038668, "mode": "train", "time_backward": 1.124249, "time_data": 0.016938, "time_diff": 1.564735, "time_forward": 0.405059, "time_loss": 0.000266}
[03/28 15:38:04] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "5290", "eta": "0:28:01", "loss": 0.062166, "lr": 0.038685, "mode": "train", "time_backward": 1.054678, "time_data": 0.160712, "time_diff": 1.665730, "time_forward": 0.409283, "time_loss": 0.000272}
[03/28 15:38:27] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "5300", "eta": "0:27:40", "loss": 0.062252, "lr": 0.038701, "mode": "train", "time_backward": 1.065622, "time_data": 0.023743, "time_diff": 1.512505, "time_forward": 0.419430, "time_loss": 0.000354}
[03/28 15:38:45] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "5310", "eta": "0:27:20", "loss": 0.059347, "lr": 0.038717, "mode": "train", "time_backward": 1.087530, "time_data": 0.027286, "time_diff": 1.521219, "time_forward": 0.399216, "time_loss": 0.000389}
[03/28 15:39:01] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "5320", "eta": "0:27:00", "loss": 0.062828, "lr": 0.038734, "mode": "train", "time_backward": 1.221224, "time_data": 0.018544, "time_diff": 2.149794, "time_forward": 0.905216, "time_loss": 0.000821}
[03/28 15:39:22] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "5330", "eta": "0:26:40", "loss": 0.064311, "lr": 0.038750, "mode": "train", "time_backward": 1.055940, "time_data": 0.016996, "time_diff": 1.478952, "time_forward": 0.399130, "time_loss": 0.000289}
[03/28 15:39:53] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "5340", "eta": "0:26:19", "loss": 0.063600, "lr": 0.038766, "mode": "train", "time_backward": 4.654244, "time_data": 0.017200, "time_diff": 5.088820, "time_forward": 0.399728, "time_loss": 0.000310}
[03/28 15:40:09] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "5350", "eta": "0:25:56", "loss": 0.061155, "lr": 0.038783, "mode": "train", "time_backward": 1.069971, "time_data": 0.017892, "time_diff": 1.511640, "time_forward": 0.410072, "time_loss": 0.000391}
[03/28 15:40:24] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "5360", "eta": "0:25:36", "loss": 0.068147, "lr": 0.038799, "mode": "train", "time_backward": 1.068613, "time_data": 0.017411, "time_diff": 1.491626, "time_forward": 0.399388, "time_loss": 0.000323}
[03/28 15:40:39] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "5370", "eta": "0:25:15", "loss": 0.066094, "lr": 0.038815, "mode": "train", "time_backward": 1.059465, "time_data": 0.017913, "time_diff": 1.489040, "time_forward": 0.400397, "time_loss": 0.000496}
[03/28 15:41:09] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "5380", "eta": "0:24:53", "loss": 0.064979, "lr": 0.038832, "mode": "train", "time_backward": 1.082945, "time_data": 0.018522, "time_diff": 1.568269, "time_forward": 0.453773, "time_loss": 0.000385}
[03/28 15:41:24] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "5390", "eta": "0:24:32", "loss": 0.064624, "lr": 0.038848, "mode": "train", "time_backward": 1.060447, "time_data": 0.019934, "time_diff": 1.486944, "time_forward": 0.399365, "time_loss": 0.000225}
[03/28 15:41:44] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "5400", "eta": "0:24:12", "loss": 0.063597, "lr": 0.038864, "mode": "train", "time_backward": 1.190143, "time_data": 0.018407, "time_diff": 1.641684, "time_forward": 0.401107, "time_loss": 0.000509}
[03/28 15:41:59] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "5410", "eta": "0:23:51", "loss": 0.061790, "lr": 0.038881, "mode": "train", "time_backward": 1.060486, "time_data": 0.018282, "time_diff": 1.492901, "time_forward": 0.398603, "time_loss": 0.000329}
[03/28 15:42:15] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "5420", "eta": "0:23:31", "loss": 0.060687, "lr": 0.038897, "mode": "train", "time_backward": 1.052438, "time_data": 0.016823, "time_diff": 1.474815, "time_forward": 0.397974, "time_loss": 0.000252}
[03/28 15:42:34] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "5430", "eta": "0:23:10", "loss": 0.062614, "lr": 0.038913, "mode": "train", "time_backward": 1.070556, "time_data": 0.029928, "time_diff": 1.588781, "time_forward": 0.404673, "time_loss": 0.000354}
[03/28 15:42:50] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "5440", "eta": "0:22:50", "loss": 0.061649, "lr": 0.038930, "mode": "train", "time_backward": 1.084313, "time_data": 0.018636, "time_diff": 1.517502, "time_forward": 0.408169, "time_loss": 0.000289}
[03/28 15:43:05] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "5450", "eta": "0:22:30", "loss": 0.061554, "lr": 0.038946, "mode": "train", "time_backward": 1.054768, "time_data": 0.022181, "time_diff": 1.619987, "time_forward": 0.533574, "time_loss": 0.000504}
[03/28 15:43:21] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "5460", "eta": "0:22:09", "loss": 0.060001, "lr": 0.038962, "mode": "train", "time_backward": 1.099228, "time_data": 0.026793, "time_diff": 1.563730, "time_forward": 0.406105, "time_loss": 0.000265}
[03/28 15:43:37] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "5470", "eta": "0:21:49", "loss": 0.064895, "lr": 0.038979, "mode": "train", "time_backward": 1.073302, "time_data": 0.017580, "time_diff": 1.566629, "time_forward": 0.442089, "time_loss": 0.002032}
[03/28 15:43:54] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "5480", "eta": "0:21:29", "loss": 0.061909, "lr": 0.038995, "mode": "train", "time_backward": 1.064675, "time_data": 0.018460, "time_diff": 1.682929, "time_forward": 0.596945, "time_loss": 0.000301}
[03/28 15:44:10] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "5490", "eta": "0:21:08", "loss": 0.064611, "lr": 0.039011, "mode": "train", "time_backward": 1.063927, "time_data": 0.020162, "time_diff": 1.551794, "time_forward": 0.398331, "time_loss": 0.000241}
[03/28 15:44:26] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "5500", "eta": "0:20:46", "loss": 0.059339, "lr": 0.039028, "mode": "train", "time_backward": 1.088757, "time_data": 0.019122, "time_diff": 1.526828, "time_forward": 0.418178, "time_loss": 0.000239}
[03/28 15:44:46] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "5510", "eta": "0:20:25", "loss": 0.062307, "lr": 0.039044, "mode": "train", "time_backward": 1.061677, "time_data": 0.019794, "time_diff": 1.494221, "time_forward": 0.398928, "time_loss": 0.000231}
[03/28 15:45:02] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "5520", "eta": "0:20:05", "loss": 0.061816, "lr": 0.039060, "mode": "train", "time_backward": 1.063537, "time_data": 0.016646, "time_diff": 1.592297, "time_forward": 0.444306, "time_loss": 0.000262}
[03/28 15:45:17] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "5530", "eta": "0:19:45", "loss": 0.067233, "lr": 0.039076, "mode": "train", "time_backward": 1.066585, "time_data": 0.018476, "time_diff": 1.521758, "time_forward": 0.403778, "time_loss": 0.000258}
[03/28 15:45:35] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "5540", "eta": "0:19:24", "loss": 0.066517, "lr": 0.039093, "mode": "train", "time_backward": 1.108191, "time_data": 0.017047, "time_diff": 1.588886, "time_forward": 0.448103, "time_loss": 0.000342}
[03/28 15:45:51] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "5550", "eta": "0:19:04", "loss": 0.065172, "lr": 0.039109, "mode": "train", "time_backward": 1.061107, "time_data": 0.019305, "time_diff": 1.530331, "time_forward": 0.447484, "time_loss": 0.000396}
[03/28 15:46:09] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "5560", "eta": "0:18:44", "loss": 0.062881, "lr": 0.039125, "mode": "train", "time_backward": 1.098029, "time_data": 0.035137, "time_diff": 1.612853, "time_forward": 0.411533, "time_loss": 0.000826}
[03/28 15:46:51] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "5570", "eta": "0:18:23", "loss": 0.066325, "lr": 0.039142, "mode": "train", "time_backward": 1.073218, "time_data": 0.021533, "time_diff": 1.522355, "time_forward": 0.402124, "time_loss": 0.022244}
[03/28 15:47:19] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "5580", "eta": "0:18:15", "loss": 0.062291, "lr": 0.039158, "mode": "train", "time_backward": 12.553384, "time_data": 0.017156, "time_diff": 12.998872, "time_forward": 0.405140, "time_loss": 0.000266}
[03/28 15:47:43] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "5590", "eta": "0:17:54", "loss": 0.061174, "lr": 0.039174, "mode": "train", "time_backward": 1.067850, "time_data": 0.017956, "time_diff": 1.506187, "time_forward": 0.403652, "time_loss": 0.000392}
[03/28 15:48:07] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "5600", "eta": "0:17:34", "loss": 0.064670, "lr": 0.039191, "mode": "train", "time_backward": 1.062241, "time_data": 0.020128, "time_diff": 1.564102, "time_forward": 0.473076, "time_loss": 0.000787}
[03/28 15:48:23] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "5610", "eta": "0:17:13", "loss": 0.054299, "lr": 0.039207, "mode": "train", "time_backward": 1.055749, "time_data": 0.017013, "time_diff": 1.535205, "time_forward": 0.458850, "time_loss": 0.000411}
[03/28 15:48:39] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "5620", "eta": "0:16:53", "loss": 0.066558, "lr": 0.039223, "mode": "train", "time_backward": 1.132839, "time_data": 0.019124, "time_diff": 1.658493, "time_forward": 0.496502, "time_loss": 0.000255}
[03/28 15:48:56] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "5630", "eta": "0:16:32", "loss": 0.060272, "lr": 0.039240, "mode": "train", "time_backward": 1.076818, "time_data": 0.018932, "time_diff": 1.578607, "time_forward": 0.402376, "time_loss": 0.000656}
[03/28 15:49:11] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "5640", "eta": "0:16:11", "loss": 0.062424, "lr": 0.039256, "mode": "train", "time_backward": 1.061470, "time_data": 0.031032, "time_diff": 1.562674, "time_forward": 0.428895, "time_loss": 0.000261}
[03/28 15:49:31] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "5650", "eta": "0:15:51", "loss": 0.065282, "lr": 0.039272, "mode": "train", "time_backward": 1.352896, "time_data": 0.017130, "time_diff": 1.871043, "time_forward": 0.423846, "time_loss": 0.000289}
[03/28 15:49:47] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "5660", "eta": "0:15:21", "loss": 0.063032, "lr": 0.039289, "mode": "train", "time_backward": 1.059840, "time_data": 0.017088, "time_diff": 1.485758, "time_forward": 0.401803, "time_loss": 0.000401}
[03/28 15:50:02] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "5670", "eta": "0:15:01", "loss": 0.064983, "lr": 0.039305, "mode": "train", "time_backward": 1.060937, "time_data": 0.017090, "time_diff": 1.514043, "time_forward": 0.398998, "time_loss": 0.000236}
[03/28 15:50:28] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "5680", "eta": "0:14:41", "loss": 0.061850, "lr": 0.039321, "mode": "train", "time_backward": 1.204417, "time_data": 0.086378, "time_diff": 2.066128, "time_forward": 0.759835, "time_loss": 0.015129}
[03/28 15:50:49] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "5690", "eta": "0:14:25", "loss": 0.060422, "lr": 0.039338, "mode": "train", "time_backward": 6.976747, "time_data": 0.017226, "time_diff": 7.417893, "time_forward": 0.420829, "time_loss": 0.000270}
[03/28 15:51:05] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "5700", "eta": "0:14:04", "loss": 0.063472, "lr": 0.039354, "mode": "train", "time_backward": 1.060176, "time_data": 0.023921, "time_diff": 1.497503, "time_forward": 0.402821, "time_loss": 0.000286}
[03/28 15:51:25] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "5710", "eta": "0:13:35", "loss": 0.062082, "lr": 0.039370, "mode": "train", "time_backward": 1.107166, "time_data": 3.912865, "time_diff": 5.623254, "time_forward": 0.582153, "time_loss": 0.000541}
[03/28 15:51:45] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "5720", "eta": "0:13:14", "loss": 0.064921, "lr": 0.039387, "mode": "train", "time_backward": 1.059973, "time_data": 0.016887, "time_diff": 1.483678, "time_forward": 0.398570, "time_loss": 0.000240}
[03/28 15:52:24] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "5730", "eta": "0:12:54", "loss": 0.060678, "lr": 0.039403, "mode": "train", "time_backward": 1.064772, "time_data": 0.018440, "time_diff": 1.491110, "time_forward": 0.399772, "time_loss": 0.000242}
[03/28 15:52:39] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "5740", "eta": "0:12:34", "loss": 0.060247, "lr": 0.039419, "mode": "train", "time_backward": 1.102742, "time_data": 0.023135, "time_diff": 1.564455, "time_forward": 0.429421, "time_loss": 0.004108}
[03/28 15:53:06] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "5750", "eta": "0:12:13", "loss": 0.060849, "lr": 0.039436, "mode": "train", "time_backward": 1.064039, "time_data": 0.034359, "time_diff": 1.499565, "time_forward": 0.399194, "time_loss": 0.000257}
[03/28 15:53:21] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "5760", "eta": "0:11:53", "loss": 0.061947, "lr": 0.039452, "mode": "train", "time_backward": 1.070948, "time_data": 0.020148, "time_diff": 1.500592, "time_forward": 0.401904, "time_loss": 0.000330}
[03/28 15:53:38] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "5770", "eta": "0:11:33", "loss": 0.066226, "lr": 0.039468, "mode": "train", "time_backward": 1.055231, "time_data": 0.018785, "time_diff": 1.483083, "time_forward": 0.405189, "time_loss": 0.000299}
[03/28 15:53:55] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "5780", "eta": "0:11:12", "loss": 0.059463, "lr": 0.039485, "mode": "train", "time_backward": 1.062591, "time_data": 0.017327, "time_diff": 1.488266, "time_forward": 0.398549, "time_loss": 0.000299}
[03/28 15:54:10] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "5790", "eta": "0:10:52", "loss": 0.062480, "lr": 0.039501, "mode": "train", "time_backward": 1.056675, "time_data": 0.018961, "time_diff": 1.523954, "time_forward": 0.411504, "time_loss": 0.001030}
[03/28 15:54:28] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "5800", "eta": "0:10:33", "loss": 0.060043, "lr": 0.039517, "mode": "train", "time_backward": 3.810662, "time_data": 0.017294, "time_diff": 4.255460, "time_forward": 0.400560, "time_loss": 0.000640}
[03/28 15:54:44] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "5810", "eta": "0:10:13", "loss": 0.061437, "lr": 0.039534, "mode": "train", "time_backward": 1.056828, "time_data": 0.018249, "time_diff": 1.513251, "time_forward": 0.399769, "time_loss": 0.000420}
[03/28 15:55:18] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "5820", "eta": "0:09:52", "loss": 0.061961, "lr": 0.039550, "mode": "train", "time_backward": 1.061233, "time_data": 0.016782, "time_diff": 1.516728, "time_forward": 0.409568, "time_loss": 0.000340}
[03/28 15:55:36] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "5830", "eta": "0:09:32", "loss": 0.066841, "lr": 0.039566, "mode": "train", "time_backward": 1.056725, "time_data": 0.016917, "time_diff": 1.477645, "time_forward": 0.399268, "time_loss": 0.000298}
[03/28 15:55:59] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "5840", "eta": "0:09:11", "loss": 0.061130, "lr": 0.039583, "mode": "train", "time_backward": 1.057526, "time_data": 0.017292, "time_diff": 1.537628, "time_forward": 0.443810, "time_loss": 0.000339}
[03/28 15:56:14] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "5850", "eta": "0:08:51", "loss": 0.064470, "lr": 0.039599, "mode": "train", "time_backward": 1.119760, "time_data": 0.034838, "time_diff": 1.556837, "time_forward": 0.398619, "time_loss": 0.000307}
[03/28 15:56:30] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "5860", "eta": "0:08:31", "loss": 0.063072, "lr": 0.039615, "mode": "train", "time_backward": 1.106665, "time_data": 0.017269, "time_diff": 1.527029, "time_forward": 0.400310, "time_loss": 0.000398}
[03/28 15:56:49] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "5870", "eta": "0:08:10", "loss": 0.059905, "lr": 0.039632, "mode": "train", "time_backward": 1.093224, "time_data": 0.017519, "time_diff": 1.713256, "time_forward": 0.594050, "time_loss": 0.000418}
[03/28 15:57:16] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "5880", "eta": "0:07:50", "loss": 0.064438, "lr": 0.039648, "mode": "train", "time_backward": 1.085679, "time_data": 0.017625, "time_diff": 1.510872, "time_forward": 0.403958, "time_loss": 0.000322}
[03/28 15:57:31] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "5890", "eta": "0:07:30", "loss": 0.058613, "lr": 0.039664, "mode": "train", "time_backward": 1.067055, "time_data": 0.018293, "time_diff": 1.492375, "time_forward": 0.401997, "time_loss": 0.001718}
[03/28 15:57:49] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "5900", "eta": "0:07:09", "loss": 0.059991, "lr": 0.039681, "mode": "train", "time_backward": 1.061038, "time_data": 0.027642, "time_diff": 1.501189, "time_forward": 0.408030, "time_loss": 0.000357}
[03/28 15:58:13] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "5910", "eta": "0:06:50", "loss": 0.064542, "lr": 0.039697, "mode": "train", "time_backward": 1.997393, "time_data": 0.017363, "time_diff": 3.887729, "time_forward": 1.782323, "time_loss": 0.076127}
[03/28 15:58:39] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "5920", "eta": "0:06:29", "loss": 0.062310, "lr": 0.039713, "mode": "train", "time_backward": 1.133559, "time_data": 0.017317, "time_diff": 1.574466, "time_forward": 0.403751, "time_loss": 0.000249}
[03/28 15:58:55] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "5930", "eta": "0:06:09", "loss": 0.061912, "lr": 0.039730, "mode": "train", "time_backward": 1.084430, "time_data": 0.017807, "time_diff": 1.521663, "time_forward": 0.400801, "time_loss": 0.000353}
[03/28 15:59:11] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "5940", "eta": "0:05:48", "loss": 0.063502, "lr": 0.039746, "mode": "train", "time_backward": 1.070776, "time_data": 0.018765, "time_diff": 1.491211, "time_forward": 0.398142, "time_loss": 0.000222}
[03/28 15:59:31] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "5950", "eta": "0:05:27", "loss": 0.061427, "lr": 0.039762, "mode": "train", "time_backward": 1.057562, "time_data": 0.017459, "time_diff": 1.552214, "time_forward": 0.418641, "time_loss": 0.000235}
[03/28 15:59:46] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "5960", "eta": "0:05:07", "loss": 0.057772, "lr": 0.039778, "mode": "train", "time_backward": 1.071597, "time_data": 0.019093, "time_diff": 1.510480, "time_forward": 0.413948, "time_loss": 0.000470}
[03/28 16:00:02] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "5970", "eta": "0:04:45", "loss": 0.064348, "lr": 0.039795, "mode": "train", "time_backward": 1.096689, "time_data": 0.018013, "time_diff": 1.586829, "time_forward": 0.412007, "time_loss": 0.000388}
[03/28 16:00:18] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "5980", "eta": "0:04:25", "loss": 0.061402, "lr": 0.039811, "mode": "train", "time_backward": 1.080264, "time_data": 0.017134, "time_diff": 1.572629, "time_forward": 0.458439, "time_loss": 0.000377}
[03/28 16:00:34] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "5990", "eta": "0:04:04", "loss": 0.062058, "lr": 0.039827, "mode": "train", "time_backward": 1.113237, "time_data": 0.022159, "time_diff": 1.569873, "time_forward": 0.431706, "time_loss": 0.000343}
[03/28 16:00:50] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "6000", "eta": "0:03:44", "loss": 0.066467, "lr": 0.039844, "mode": "train", "time_backward": 1.062781, "time_data": 0.017467, "time_diff": 1.582909, "time_forward": 0.449432, "time_loss": 0.000399}
[03/28 16:01:05] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "6010", "eta": "0:03:24", "loss": 0.062808, "lr": 0.039860, "mode": "train", "time_backward": 1.185589, "time_data": 0.022702, "time_diff": 1.662346, "time_forward": 0.432909, "time_loss": 0.000244}
[03/28 16:01:22] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "6020", "eta": "0:03:04", "loss": 0.063161, "lr": 0.039876, "mode": "train", "time_backward": 1.181255, "time_data": 0.016964, "time_diff": 1.927151, "time_forward": 0.674713, "time_loss": 0.018580}
[03/28 16:01:38] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "6030", "eta": "0:02:44", "loss": 0.063828, "lr": 0.039893, "mode": "train", "time_backward": 1.071330, "time_data": 0.017121, "time_diff": 1.494533, "time_forward": 0.399330, "time_loss": 0.000292}
[03/28 16:01:53] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "6040", "eta": "0:02:23", "loss": 0.067247, "lr": 0.039909, "mode": "train", "time_backward": 1.094060, "time_data": 0.021546, "time_diff": 1.554197, "time_forward": 0.414424, "time_loss": 0.000249}
[03/28 16:02:09] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "6050", "eta": "0:02:02", "loss": 0.068148, "lr": 0.039925, "mode": "train", "time_backward": 1.106250, "time_data": 0.027078, "time_diff": 1.563407, "time_forward": 0.415184, "time_loss": 0.000275}
[03/28 16:02:25] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "6060", "eta": "0:01:42", "loss": 0.061588, "lr": 0.039942, "mode": "train", "time_backward": 1.072747, "time_data": 0.029199, "time_diff": 1.540930, "time_forward": 0.437560, "time_loss": 0.000426}
[03/28 16:02:41] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "6070", "eta": "0:01:22", "loss": 0.062054, "lr": 0.039958, "mode": "train", "time_backward": 1.124293, "time_data": 0.017730, "time_diff": 1.560495, "time_forward": 0.399078, "time_loss": 0.000254}
[03/28 16:02:56] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "6080", "eta": "0:01:02", "loss": 0.065757, "lr": 0.039974, "mode": "train", "time_backward": 1.059893, "time_data": 0.035854, "time_diff": 1.508274, "time_forward": 0.401725, "time_loss": 0.000438}
[03/28 16:03:11] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "6090", "eta": "0:00:42", "loss": 0.063162, "lr": 0.039991, "mode": "train", "time_backward": 1.062902, "time_data": 0.017358, "time_diff": 1.484431, "time_forward": 0.400624, "time_loss": 0.000511}
[03/28 16:03:27] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "6100", "eta": "0:00:22", "loss": 0.065184, "lr": 0.040007, "mode": "train", "time_backward": 1.051954, "time_data": 0.016538, "time_diff": 1.472756, "time_forward": 0.396734, "time_loss": 0.000219}
[03/28 16:03:41] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "4", "cur_iter": "6110", "eta": "0:00:02", "loss": 0.064890, "lr": 0.040023, "mode": "train", "time_backward": 1.052083, "time_data": 0.016584, "time_diff": 1.473229, "time_forward": 0.397862, "time_loss": 0.000203}
[03/28 16:14:29] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "10", "eta": "3:23:30", "loss": 0.063693, "lr": 0.040040, "mode": "train", "time_backward": 1.088467, "time_data": 0.016724, "time_diff": 1.543713, "time_forward": 0.399054, "time_loss": 0.000276}
[03/28 16:14:56] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "20", "eta": "3:20:41", "loss": 0.066651, "lr": 0.040056, "mode": "train", "time_backward": 1.056591, "time_data": 0.028171, "time_diff": 1.509802, "time_forward": 0.421417, "time_loss": 0.000414}
[03/28 16:15:11] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "30", "eta": "3:20:23", "loss": 0.059651, "lr": 0.040072, "mode": "train", "time_backward": 1.100955, "time_data": 0.016928, "time_diff": 1.610042, "time_forward": 0.400702, "time_loss": 0.000400}
[03/28 16:15:32] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "40", "eta": "3:20:03", "loss": 0.060935, "lr": 0.040089, "mode": "train", "time_backward": 1.089894, "time_data": 0.017566, "time_diff": 1.574622, "time_forward": 0.427517, "time_loss": 0.000445}
[03/28 16:16:00] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "50", "eta": "3:22:02", "loss": 0.058319, "lr": 0.040105, "mode": "train", "time_backward": 12.473359, "time_data": 0.017758, "time_diff": 12.899378, "time_forward": 0.402052, "time_loss": 0.000746}
[03/28 16:16:15] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "60", "eta": "3:21:42", "loss": 0.058245, "lr": 0.040121, "mode": "train", "time_backward": 1.125722, "time_data": 0.016843, "time_diff": 1.572711, "time_forward": 0.399562, "time_loss": 0.000249}
[03/28 16:16:32] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "70", "eta": "3:21:23", "loss": 0.060038, "lr": 0.040138, "mode": "train", "time_backward": 1.103404, "time_data": 0.018249, "time_diff": 1.554069, "time_forward": 0.425816, "time_loss": 0.003746}
[03/28 16:16:48] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "80", "eta": "3:21:04", "loss": 0.060772, "lr": 0.040154, "mode": "train", "time_backward": 1.059261, "time_data": 0.024832, "time_diff": 1.566266, "time_forward": 0.418310, "time_loss": 0.000364}
[03/28 16:17:07] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "90", "eta": "3:20:44", "loss": 0.064307, "lr": 0.040170, "mode": "train", "time_backward": 1.073393, "time_data": 0.029336, "time_diff": 1.525976, "time_forward": 0.410918, "time_loss": 0.000351}
[03/28 16:17:41] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "100", "eta": "3:20:24", "loss": 0.059531, "lr": 0.040187, "mode": "train", "time_backward": 1.090240, "time_data": 0.020435, "time_diff": 1.524125, "time_forward": 0.399493, "time_loss": 0.000281}
[03/28 16:17:56] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "110", "eta": "3:20:05", "loss": 0.058595, "lr": 0.040203, "mode": "train", "time_backward": 1.090617, "time_data": 0.017887, "time_diff": 1.519726, "time_forward": 0.399032, "time_loss": 0.000390}
[03/28 16:18:11] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "120", "eta": "3:19:45", "loss": 0.058416, "lr": 0.040219, "mode": "train", "time_backward": 1.126198, "time_data": 0.017615, "time_diff": 1.590545, "time_forward": 0.399086, "time_loss": 0.000352}
[03/28 16:18:27] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "130", "eta": "3:19:24", "loss": 0.059379, "lr": 0.040236, "mode": "train", "time_backward": 1.058616, "time_data": 0.016792, "time_diff": 1.481136, "time_forward": 0.398781, "time_loss": 0.000280}
[03/28 16:18:43] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "140", "eta": "3:19:08", "loss": 0.059592, "lr": 0.040252, "mode": "train", "time_backward": 1.169008, "time_data": 0.017494, "time_diff": 1.849089, "time_forward": 0.656182, "time_loss": 0.000373}
[03/28 16:19:01] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "150", "eta": "3:18:55", "loss": 0.061784, "lr": 0.040268, "mode": "train", "time_backward": 1.056258, "time_data": 0.016866, "time_diff": 2.008843, "time_forward": 0.932059, "time_loss": 0.000312}
[03/28 16:20:12] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "160", "eta": "3:18:41", "loss": 0.061446, "lr": 0.040285, "mode": "train", "time_backward": 1.170729, "time_data": 0.030036, "time_diff": 2.034771, "time_forward": 0.777387, "time_loss": 0.005574}
[03/28 16:20:35] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "170", "eta": "3:18:20", "loss": 0.063358, "lr": 0.040301, "mode": "train", "time_backward": 1.056299, "time_data": 0.017679, "time_diff": 1.482349, "time_forward": 0.404807, "time_loss": 0.000345}
[03/28 16:20:51] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "180", "eta": "3:18:00", "loss": 0.061822, "lr": 0.040317, "mode": "train", "time_backward": 1.056236, "time_data": 0.019203, "time_diff": 1.482029, "time_forward": 0.401458, "time_loss": 0.000264}
[03/28 16:21:17] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "190", "eta": "3:18:56", "loss": 0.058615, "lr": 0.040334, "mode": "train", "time_backward": 8.098172, "time_data": 0.017034, "time_diff": 8.525578, "time_forward": 0.398939, "time_loss": 0.000255}
[03/28 16:21:37] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "200", "eta": "3:18:36", "loss": 0.059858, "lr": 0.040350, "mode": "train", "time_backward": 1.065535, "time_data": 0.041475, "time_diff": 1.532222, "time_forward": 0.399120, "time_loss": 0.000262}
[03/28 16:21:52] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "210", "eta": "3:18:15", "loss": 0.060120, "lr": 0.040366, "mode": "train", "time_backward": 1.080711, "time_data": 0.025109, "time_diff": 1.512991, "time_forward": 0.400344, "time_loss": 0.000522}
[03/28 16:22:07] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "220", "eta": "3:17:56", "loss": 0.065289, "lr": 0.040383, "mode": "train", "time_backward": 1.160269, "time_data": 0.016828, "time_diff": 1.606616, "time_forward": 0.399327, "time_loss": 0.000266}
[03/28 16:22:47] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "230", "eta": "3:21:41", "loss": 0.063808, "lr": 0.040399, "mode": "train", "time_backward": 21.836691, "time_data": 0.017276, "time_diff": 22.297467, "time_forward": 0.397659, "time_loss": 0.000297}
[03/28 16:23:30] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "240", "eta": "3:10:58", "loss": 0.057766, "lr": 0.040415, "mode": "train", "time_backward": 1.105591, "time_data": 0.017216, "time_diff": 1.533164, "time_forward": 0.400949, "time_loss": 0.000313}
[03/28 16:23:45] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "250", "eta": "3:10:38", "loss": 0.057759, "lr": 0.040432, "mode": "train", "time_backward": 1.056916, "time_data": 0.017674, "time_diff": 1.484844, "time_forward": 0.398912, "time_loss": 0.000324}
[03/28 16:24:27] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "260", "eta": "3:10:17", "loss": 0.060017, "lr": 0.040448, "mode": "train", "time_backward": 1.058985, "time_data": 0.017732, "time_diff": 1.483753, "time_forward": 0.401256, "time_loss": 0.000679}
[03/28 16:24:43] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "270", "eta": "3:09:58", "loss": 0.063694, "lr": 0.040464, "mode": "train", "time_backward": 1.067562, "time_data": 0.018638, "time_diff": 1.493400, "time_forward": 0.404806, "time_loss": 0.000235}
[03/28 16:24:59] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "280", "eta": "3:09:35", "loss": 0.059205, "lr": 0.040480, "mode": "train", "time_backward": 1.069547, "time_data": 0.020499, "time_diff": 1.503168, "time_forward": 0.405508, "time_loss": 0.000374}
[03/28 16:25:14] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "290", "eta": "3:09:15", "loss": 0.065396, "lr": 0.040497, "mode": "train", "time_backward": 1.055933, "time_data": 0.017338, "time_diff": 1.478616, "time_forward": 0.399749, "time_loss": 0.000248}
[03/28 16:25:32] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "300", "eta": "3:08:56", "loss": 0.060596, "lr": 0.040513, "mode": "train", "time_backward": 1.058936, "time_data": 0.017579, "time_diff": 1.535554, "time_forward": 0.427380, "time_loss": 0.000274}
[03/28 16:25:52] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "310", "eta": "3:08:36", "loss": 0.062351, "lr": 0.040529, "mode": "train", "time_backward": 1.065559, "time_data": 0.020194, "time_diff": 1.493010, "time_forward": 0.399998, "time_loss": 0.000365}
[03/28 16:26:19] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "320", "eta": "3:08:16", "loss": 0.059080, "lr": 0.040546, "mode": "train", "time_backward": 1.055832, "time_data": 0.017596, "time_diff": 1.495299, "time_forward": 0.417601, "time_loss": 0.000244}
[03/28 16:26:44] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "330", "eta": "3:07:57", "loss": 0.057544, "lr": 0.040562, "mode": "train", "time_backward": 1.105378, "time_data": 0.020467, "time_diff": 1.532586, "time_forward": 0.402546, "time_loss": 0.000251}
[03/28 16:26:59] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "340", "eta": "3:07:38", "loss": 0.061475, "lr": 0.040578, "mode": "train", "time_backward": 1.066401, "time_data": 0.020196, "time_diff": 1.500791, "time_forward": 0.404944, "time_loss": 0.000238}
[03/28 16:27:15] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "350", "eta": "3:07:19", "loss": 0.058770, "lr": 0.040595, "mode": "train", "time_backward": 1.081816, "time_data": 0.017864, "time_diff": 1.600732, "time_forward": 0.398501, "time_loss": 0.000235}
[03/28 16:27:31] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "360", "eta": "3:07:01", "loss": 0.060139, "lr": 0.040611, "mode": "train", "time_backward": 1.058660, "time_data": 0.017306, "time_diff": 1.573992, "time_forward": 0.416945, "time_loss": 0.000544}
[03/28 16:27:47] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "370", "eta": "3:06:41", "loss": 0.055820, "lr": 0.040627, "mode": "train", "time_backward": 1.126705, "time_data": 0.017730, "time_diff": 1.552506, "time_forward": 0.401126, "time_loss": 0.000246}
[03/28 16:28:02] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "380", "eta": "3:06:22", "loss": 0.062505, "lr": 0.040644, "mode": "train", "time_backward": 1.068627, "time_data": 0.017122, "time_diff": 1.490534, "time_forward": 0.399241, "time_loss": 0.000245}
[03/28 16:28:24] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "390", "eta": "3:06:58", "loss": 0.063382, "lr": 0.040660, "mode": "train", "time_backward": 2.538972, "time_data": 0.041147, "time_diff": 6.378493, "time_forward": 3.698742, "time_loss": 0.095427}
[03/28 16:28:40] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "400", "eta": "3:06:43", "loss": 0.064511, "lr": 0.040676, "mode": "train", "time_backward": 1.104369, "time_data": 0.017508, "time_diff": 1.815823, "time_forward": 0.692122, "time_loss": 0.000309}
[03/28 16:28:57] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "410", "eta": "3:06:22", "loss": 0.062137, "lr": 0.040693, "mode": "train", "time_backward": 1.080499, "time_data": 0.018224, "time_diff": 1.505545, "time_forward": 0.403204, "time_loss": 0.000335}
[03/28 16:29:13] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "420", "eta": "3:06:05", "loss": 0.061880, "lr": 0.040709, "mode": "train", "time_backward": 1.130640, "time_data": 0.017903, "time_diff": 1.810841, "time_forward": 0.652517, "time_loss": 0.001275}
[03/28 16:29:30] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "430", "eta": "3:05:42", "loss": 0.058798, "lr": 0.040725, "mode": "train", "time_backward": 1.057557, "time_data": 0.018246, "time_diff": 1.491785, "time_forward": 0.400415, "time_loss": 0.000307}
[03/28 16:29:45] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "440", "eta": "3:05:28", "loss": 0.060789, "lr": 0.040742, "mode": "train", "time_backward": 1.061622, "time_data": 0.523278, "time_diff": 2.082023, "time_forward": 0.399270, "time_loss": 0.000235}
[03/28 16:30:06] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "450", "eta": "3:05:08", "loss": 0.057023, "lr": 0.040758, "mode": "train", "time_backward": 1.054954, "time_data": 0.017431, "time_diff": 1.478695, "time_forward": 0.398652, "time_loss": 0.000227}
[03/28 16:30:34] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "460", "eta": "3:05:11", "loss": 0.057128, "lr": 0.040774, "mode": "train", "time_backward": 3.093218, "time_data": 0.016817, "time_diff": 3.535151, "time_forward": 0.402072, "time_loss": 0.000249}
[03/28 16:30:50] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "470", "eta": "3:04:52", "loss": 0.067445, "lr": 0.040791, "mode": "train", "time_backward": 1.089129, "time_data": 0.049043, "time_diff": 1.548539, "time_forward": 0.399392, "time_loss": 0.000282}
[03/28 16:31:16] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "480", "eta": "3:03:45", "loss": 0.054706, "lr": 0.040807, "mode": "train", "time_backward": 1.055451, "time_data": 0.017960, "time_diff": 1.481037, "time_forward": 0.399462, "time_loss": 0.000249}
[03/28 16:31:32] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "490", "eta": "3:03:26", "loss": 0.063706, "lr": 0.040823, "mode": "train", "time_backward": 1.108070, "time_data": 0.016833, "time_diff": 1.552993, "time_forward": 0.412895, "time_loss": 0.000267}
[03/28 16:31:49] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "500", "eta": "3:03:08", "loss": 0.060800, "lr": 0.040840, "mode": "train", "time_backward": 1.107439, "time_data": 0.021903, "time_diff": 1.580659, "time_forward": 0.447488, "time_loss": 0.000276}
[03/28 16:32:16] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "510", "eta": "3:02:48", "loss": 0.058565, "lr": 0.040856, "mode": "train", "time_backward": 1.077427, "time_data": 0.018353, "time_diff": 1.495246, "time_forward": 0.398846, "time_loss": 0.000344}
[03/28 16:32:33] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "520", "eta": "3:02:49", "loss": 0.059216, "lr": 0.040872, "mode": "train", "time_backward": 2.912469, "time_data": 0.017639, "time_diff": 3.359007, "time_forward": 0.405872, "time_loss": 0.000280}
[03/28 16:32:53] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "530", "eta": "3:02:32", "loss": 0.064624, "lr": 0.040889, "mode": "train", "time_backward": 1.122219, "time_data": 0.021066, "time_diff": 1.742327, "time_forward": 0.594437, "time_loss": 0.000361}
[03/28 16:33:09] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "540", "eta": "3:01:20", "loss": 0.059117, "lr": 0.040905, "mode": "train", "time_backward": 1.053968, "time_data": 0.017285, "time_diff": 1.525616, "time_forward": 0.450814, "time_loss": 0.000246}
[03/28 16:33:26] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "550", "eta": "3:01:02", "loss": 0.060881, "lr": 0.040921, "mode": "train", "time_backward": 1.195316, "time_data": 0.017044, "time_diff": 1.641358, "time_forward": 0.402474, "time_loss": 0.000258}
[03/28 16:33:42] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "560", "eta": "3:00:43", "loss": 0.061715, "lr": 0.040938, "mode": "train", "time_backward": 1.126553, "time_data": 0.017968, "time_diff": 1.550185, "time_forward": 0.398391, "time_loss": 0.000230}
[03/28 16:33:58] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "570", "eta": "3:00:27", "loss": 0.060009, "lr": 0.040954, "mode": "train", "time_backward": 1.074710, "time_data": 0.017485, "time_diff": 1.790486, "time_forward": 0.642376, "time_loss": 0.000275}
[03/28 16:34:14] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "580", "eta": "3:00:07", "loss": 0.058587, "lr": 0.040970, "mode": "train", "time_backward": 1.051872, "time_data": 0.016692, "time_diff": 1.479962, "time_forward": 0.397895, "time_loss": 0.000218}
[03/28 16:34:31] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "590", "eta": "2:59:49", "loss": 0.063441, "lr": 0.040987, "mode": "train", "time_backward": 1.189503, "time_data": 0.017825, "time_diff": 1.620798, "time_forward": 0.409808, "time_loss": 0.000400}
[03/28 16:34:47] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "600", "eta": "2:59:29", "loss": 0.059117, "lr": 0.041003, "mode": "train", "time_backward": 1.058701, "time_data": 0.016934, "time_diff": 1.483491, "time_forward": 0.404961, "time_loss": 0.000336}
[03/28 16:35:03] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "610", "eta": "2:59:10", "loss": 0.058985, "lr": 0.041019, "mode": "train", "time_backward": 1.075174, "time_data": 0.025101, "time_diff": 1.551880, "time_forward": 0.398208, "time_loss": 0.000249}
[03/28 16:35:20] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "620", "eta": "2:58:51", "loss": 0.059950, "lr": 0.041036, "mode": "train", "time_backward": 1.071906, "time_data": 0.018915, "time_diff": 1.508175, "time_forward": 0.397876, "time_loss": 0.000235}
[03/28 16:35:40] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "630", "eta": "2:58:31", "loss": 0.054193, "lr": 0.041052, "mode": "train", "time_backward": 1.059842, "time_data": 0.019588, "time_diff": 1.489039, "time_forward": 0.401825, "time_loss": 0.000366}
[03/28 16:35:56] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "640", "eta": "2:58:12", "loss": 0.061154, "lr": 0.041068, "mode": "train", "time_backward": 1.070108, "time_data": 0.017907, "time_diff": 1.541986, "time_forward": 0.450463, "time_loss": 0.000284}
[03/28 16:36:12] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "650", "eta": "2:57:53", "loss": 0.063276, "lr": 0.041085, "mode": "train", "time_backward": 1.109285, "time_data": 0.021173, "time_diff": 1.533193, "time_forward": 0.399057, "time_loss": 0.000341}
[03/28 16:36:27] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "660", "eta": "2:51:47", "loss": 0.058058, "lr": 0.041101, "mode": "train", "time_backward": 1.078789, "time_data": 0.026720, "time_diff": 1.544097, "time_forward": 0.398305, "time_loss": 0.000264}
[03/28 16:36:46] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "670", "eta": "2:51:27", "loss": 0.063862, "lr": 0.041117, "mode": "train", "time_backward": 1.058459, "time_data": 0.017367, "time_diff": 1.484549, "time_forward": 0.399652, "time_loss": 0.000330}
[03/28 16:37:01] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "680", "eta": "2:51:07", "loss": 0.061485, "lr": 0.041134, "mode": "train", "time_backward": 1.086161, "time_data": 0.019742, "time_diff": 1.563874, "time_forward": 0.407398, "time_loss": 0.001230}
[03/28 16:37:17] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "690", "eta": "2:50:53", "loss": 0.066925, "lr": 0.041150, "mode": "train", "time_backward": 1.072973, "time_data": 0.022062, "time_diff": 1.922025, "time_forward": 0.622591, "time_loss": 0.000265}
[03/28 16:37:34] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "700", "eta": "2:50:35", "loss": 0.060791, "lr": 0.041166, "mode": "train", "time_backward": 1.097011, "time_data": 0.017135, "time_diff": 1.536291, "time_forward": 0.405428, "time_loss": 0.000758}
[03/28 16:37:54] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "710", "eta": "2:49:30", "loss": 0.053645, "lr": 0.041182, "mode": "train", "time_backward": 1.091240, "time_data": 0.016969, "time_diff": 1.511371, "time_forward": 0.400700, "time_loss": 0.000243}
[03/28 16:38:10] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "720", "eta": "2:49:12", "loss": 0.057624, "lr": 0.041199, "mode": "train", "time_backward": 1.103726, "time_data": 0.017529, "time_diff": 1.562341, "time_forward": 0.403372, "time_loss": 0.000803}
[03/28 16:38:27] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "730", "eta": "2:48:43", "loss": 0.062376, "lr": 0.041215, "mode": "train", "time_backward": 1.058096, "time_data": 0.017307, "time_diff": 1.479833, "time_forward": 0.400727, "time_loss": 0.000545}
[03/28 16:39:17] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "740", "eta": "2:48:24", "loss": 0.059439, "lr": 0.041231, "mode": "train", "time_backward": 1.110271, "time_data": 0.016809, "time_diff": 1.532012, "time_forward": 0.400757, "time_loss": 0.000282}
[03/28 16:39:36] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "750", "eta": "2:48:06", "loss": 0.058560, "lr": 0.041248, "mode": "train", "time_backward": 1.057681, "time_data": 0.016803, "time_diff": 1.494672, "time_forward": 0.399185, "time_loss": 0.000267}
[03/28 16:39:51] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "760", "eta": "2:47:47", "loss": 0.062593, "lr": 0.041264, "mode": "train", "time_backward": 1.054773, "time_data": 0.017592, "time_diff": 1.481139, "time_forward": 0.401173, "time_loss": 0.000600}
[03/28 16:40:28] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "770", "eta": "2:47:29", "loss": 0.058396, "lr": 0.041280, "mode": "train", "time_backward": 1.186541, "time_data": 0.017437, "time_diff": 1.610183, "time_forward": 0.398803, "time_loss": 0.000300}
[03/28 16:40:47] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "780", "eta": "2:47:11", "loss": 0.060508, "lr": 0.041297, "mode": "train", "time_backward": 1.055078, "time_data": 0.019168, "time_diff": 1.482198, "time_forward": 0.398587, "time_loss": 0.000228}
[03/28 16:41:02] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "790", "eta": "2:46:52", "loss": 0.060132, "lr": 0.041313, "mode": "train", "time_backward": 1.096905, "time_data": 0.019842, "time_diff": 1.538541, "time_forward": 0.404118, "time_loss": 0.000352}
[03/28 16:41:42] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "800", "eta": "2:46:33", "loss": 0.058491, "lr": 0.041329, "mode": "train", "time_backward": 1.055444, "time_data": 0.017195, "time_diff": 1.479140, "time_forward": 0.398684, "time_loss": 0.000289}
[03/28 16:41:57] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "810", "eta": "2:44:40", "loss": 0.064745, "lr": 0.041346, "mode": "train", "time_backward": 1.096844, "time_data": 0.017164, "time_diff": 1.521378, "time_forward": 0.398960, "time_loss": 0.000372}
[03/28 16:42:12] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "820", "eta": "2:44:22", "loss": 0.059786, "lr": 0.041362, "mode": "train", "time_backward": 1.055812, "time_data": 0.019923, "time_diff": 1.480742, "time_forward": 0.401031, "time_loss": 0.000746}
[03/28 16:42:39] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "830", "eta": "2:44:03", "loss": 0.059741, "lr": 0.041378, "mode": "train", "time_backward": 1.055708, "time_data": 0.017149, "time_diff": 1.476689, "time_forward": 0.399779, "time_loss": 0.000270}
[03/28 16:42:55] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "840", "eta": "2:43:45", "loss": 0.054600, "lr": 0.041395, "mode": "train", "time_backward": 1.116181, "time_data": 0.017602, "time_diff": 1.603104, "time_forward": 0.414768, "time_loss": 0.000287}
[03/28 16:43:16] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "850", "eta": "2:43:25", "loss": 0.060305, "lr": 0.041411, "mode": "train", "time_backward": 1.056851, "time_data": 0.017266, "time_diff": 1.485154, "time_forward": 0.399177, "time_loss": 0.000258}
[03/28 16:43:36] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "860", "eta": "2:43:07", "loss": 0.061419, "lr": 0.041427, "mode": "train", "time_backward": 1.072282, "time_data": 0.029243, "time_diff": 1.544868, "time_forward": 0.422828, "time_loss": 0.001233}
[03/28 16:43:52] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "870", "eta": "2:42:48", "loss": 0.060514, "lr": 0.041444, "mode": "train", "time_backward": 1.062145, "time_data": 0.017097, "time_diff": 1.485252, "time_forward": 0.399048, "time_loss": 0.000340}
[03/28 16:44:19] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "880", "eta": "2:42:30", "loss": 0.062606, "lr": 0.041460, "mode": "train", "time_backward": 1.057936, "time_data": 0.017163, "time_diff": 1.483833, "time_forward": 0.400393, "time_loss": 0.000306}
[03/28 16:44:57] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "890", "eta": "2:43:52", "loss": 0.063975, "lr": 0.041476, "mode": "train", "time_backward": 10.723676, "time_data": 0.017157, "time_diff": 11.173424, "time_forward": 0.398942, "time_loss": 0.000267}
[03/28 16:45:12] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "900", "eta": "2:43:33", "loss": 0.058957, "lr": 0.041493, "mode": "train", "time_backward": 1.059914, "time_data": 0.016884, "time_diff": 1.483051, "time_forward": 0.398585, "time_loss": 0.000228}
[03/28 16:45:42] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "910", "eta": "2:43:14", "loss": 0.059160, "lr": 0.041509, "mode": "train", "time_backward": 1.088389, "time_data": 0.020712, "time_diff": 1.517839, "time_forward": 0.404842, "time_loss": 0.000309}
[03/28 16:45:58] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "920", "eta": "2:42:55", "loss": 0.063546, "lr": 0.041525, "mode": "train", "time_backward": 1.177271, "time_data": 0.017157, "time_diff": 1.602041, "time_forward": 0.405678, "time_loss": 0.000363}
[03/28 16:46:13] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "930", "eta": "2:42:36", "loss": 0.062309, "lr": 0.041542, "mode": "train", "time_backward": 1.100534, "time_data": 0.016912, "time_diff": 1.519427, "time_forward": 0.398335, "time_loss": 0.000271}
[03/28 16:46:29] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "940", "eta": "2:42:18", "loss": 0.064539, "lr": 0.041558, "mode": "train", "time_backward": 1.060654, "time_data": 0.026635, "time_diff": 1.645413, "time_forward": 0.545147, "time_loss": 0.000672}
[03/28 16:46:45] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "950", "eta": "2:41:58", "loss": 0.063139, "lr": 0.041574, "mode": "train", "time_backward": 1.169143, "time_data": 0.017029, "time_diff": 1.598870, "time_forward": 0.400704, "time_loss": 0.000252}
[03/28 16:47:02] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "960", "eta": "2:41:40", "loss": 0.060521, "lr": 0.041591, "mode": "train", "time_backward": 1.076690, "time_data": 0.017897, "time_diff": 1.623223, "time_forward": 0.410564, "time_loss": 0.000281}
[03/28 16:47:17] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "970", "eta": "2:41:22", "loss": 0.067368, "lr": 0.041607, "mode": "train", "time_backward": 1.064856, "time_data": 0.048374, "time_diff": 1.570951, "time_forward": 0.454933, "time_loss": 0.000362}
[03/28 16:47:33] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "980", "eta": "2:41:03", "loss": 0.058857, "lr": 0.041623, "mode": "train", "time_backward": 1.057171, "time_data": 0.027345, "time_diff": 1.509962, "time_forward": 0.405723, "time_loss": 0.000238}
[03/28 16:47:48] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "990", "eta": "2:40:46", "loss": 0.065584, "lr": 0.041640, "mode": "train", "time_backward": 1.215083, "time_data": 0.022816, "time_diff": 1.652867, "time_forward": 0.407734, "time_loss": 0.000226}
[03/28 16:48:04] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "1000", "eta": "2:40:08", "loss": 0.053505, "lr": 0.041656, "mode": "train", "time_backward": 1.072923, "time_data": 0.017286, "time_diff": 1.551106, "time_forward": 0.443793, "time_loss": 0.000347}
[03/28 16:48:20] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "1010", "eta": "2:39:49", "loss": 0.060994, "lr": 0.041672, "mode": "train", "time_backward": 1.091826, "time_data": 0.020964, "time_diff": 1.534440, "time_forward": 0.398852, "time_loss": 0.000332}
[03/28 16:48:36] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "1020", "eta": "2:39:31", "loss": 0.056907, "lr": 0.041689, "mode": "train", "time_backward": 1.085551, "time_data": 0.019383, "time_diff": 1.563496, "time_forward": 0.418387, "time_loss": 0.000318}
[03/28 16:48:52] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "1030", "eta": "2:39:13", "loss": 0.060605, "lr": 0.041705, "mode": "train", "time_backward": 1.116160, "time_data": 0.017113, "time_diff": 1.544123, "time_forward": 0.399441, "time_loss": 0.000320}
[03/28 16:49:07] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "1040", "eta": "2:38:54", "loss": 0.058926, "lr": 0.041721, "mode": "train", "time_backward": 1.059424, "time_data": 0.018941, "time_diff": 1.481715, "time_forward": 0.399966, "time_loss": 0.000266}
[03/28 16:49:23] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "1050", "eta": "2:38:36", "loss": 0.057347, "lr": 0.041738, "mode": "train", "time_backward": 1.060457, "time_data": 0.025104, "time_diff": 1.547340, "time_forward": 0.415697, "time_loss": 0.000974}
[03/28 16:49:45] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "1060", "eta": "2:38:44", "loss": 0.058771, "lr": 0.041754, "mode": "train", "time_backward": 3.144612, "time_data": 0.018701, "time_diff": 4.198718, "time_forward": 0.401672, "time_loss": 0.000402}
[03/28 16:50:01] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "1070", "eta": "2:38:25", "loss": 0.057462, "lr": 0.041770, "mode": "train", "time_backward": 1.055932, "time_data": 0.017497, "time_diff": 1.480339, "time_forward": 0.400093, "time_loss": 0.000325}
[03/28 16:50:16] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "1080", "eta": "2:38:05", "loss": 0.060065, "lr": 0.041787, "mode": "train", "time_backward": 1.078802, "time_data": 0.017804, "time_diff": 1.509384, "time_forward": 0.399145, "time_loss": 0.000253}
[03/28 16:50:32] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "1090", "eta": "2:37:47", "loss": 0.059500, "lr": 0.041803, "mode": "train", "time_backward": 1.084687, "time_data": 0.017265, "time_diff": 1.536984, "time_forward": 0.411829, "time_loss": 0.016314}
[03/28 16:51:11] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "1100", "eta": "2:37:29", "loss": 0.059421, "lr": 0.041819, "mode": "train", "time_backward": 1.083744, "time_data": 0.019319, "time_diff": 1.556368, "time_forward": 0.400522, "time_loss": 0.000673}
[03/28 16:51:27] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "1110", "eta": "2:37:10", "loss": 0.063561, "lr": 0.041836, "mode": "train", "time_backward": 1.065550, "time_data": 0.021113, "time_diff": 1.495825, "time_forward": 0.401110, "time_loss": 0.000451}
[03/28 16:51:52] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "1120", "eta": "2:36:51", "loss": 0.053948, "lr": 0.041852, "mode": "train", "time_backward": 1.056585, "time_data": 0.016899, "time_diff": 1.480552, "time_forward": 0.398743, "time_loss": 0.000265}
[03/28 16:52:22] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "1130", "eta": "2:36:26", "loss": 0.059802, "lr": 0.041868, "mode": "train", "time_backward": 1.058446, "time_data": 0.016930, "time_diff": 1.486390, "time_forward": 0.401804, "time_loss": 0.000285}
[03/28 16:52:39] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "1140", "eta": "2:36:10", "loss": 0.061214, "lr": 0.041884, "mode": "train", "time_backward": 1.158869, "time_data": 0.022081, "time_diff": 1.839208, "time_forward": 0.421318, "time_loss": 0.000371}
[03/28 16:52:54] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "1150", "eta": "2:35:50", "loss": 0.058292, "lr": 0.041901, "mode": "train", "time_backward": 1.055272, "time_data": 0.016882, "time_diff": 1.478924, "time_forward": 0.399765, "time_loss": 0.000325}
[03/28 16:53:24] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "1160", "eta": "2:35:32", "loss": 0.056283, "lr": 0.041917, "mode": "train", "time_backward": 1.142077, "time_data": 0.016882, "time_diff": 1.566341, "time_forward": 0.400506, "time_loss": 0.000425}
[03/28 16:53:48] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "1170", "eta": "2:35:13", "loss": 0.063907, "lr": 0.041933, "mode": "train", "time_backward": 1.056515, "time_data": 0.018326, "time_diff": 1.478440, "time_forward": 0.399141, "time_loss": 0.000221}
[03/28 16:54:18] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "1180", "eta": "2:34:54", "loss": 0.062777, "lr": 0.041950, "mode": "train", "time_backward": 1.065043, "time_data": 0.017160, "time_diff": 1.574223, "time_forward": 0.488233, "time_loss": 0.000370}
[03/28 16:54:33] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "1190", "eta": "2:34:35", "loss": 0.062985, "lr": 0.041966, "mode": "train", "time_backward": 1.053905, "time_data": 0.017176, "time_diff": 1.476337, "time_forward": 0.396916, "time_loss": 0.000217}
[03/28 16:57:03] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "1200", "eta": "2:34:16", "loss": 0.062071, "lr": 0.041982, "mode": "train", "time_backward": 1.081851, "time_data": 0.024078, "time_diff": 1.562043, "time_forward": 0.451327, "time_loss": 0.000253}
[03/28 16:57:18] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "1210", "eta": "2:33:56", "loss": 0.061884, "lr": 0.041999, "mode": "train", "time_backward": 1.073934, "time_data": 0.020494, "time_diff": 1.590611, "time_forward": 0.490052, "time_loss": 0.001132}
[03/28 16:57:34] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "1220", "eta": "2:33:35", "loss": 0.058890, "lr": 0.042015, "mode": "train", "time_backward": 1.056010, "time_data": 0.017156, "time_diff": 1.480978, "time_forward": 0.399382, "time_loss": 0.000249}
[03/28 16:57:49] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "1230", "eta": "2:33:18", "loss": 0.055166, "lr": 0.042031, "mode": "train", "time_backward": 1.068051, "time_data": 0.020614, "time_diff": 1.704141, "time_forward": 0.426900, "time_loss": 0.000522}
[03/28 16:58:05] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "1240", "eta": "2:33:00", "loss": 0.060221, "lr": 0.042048, "mode": "train", "time_backward": 1.074116, "time_data": 0.018663, "time_diff": 1.537902, "time_forward": 0.444569, "time_loss": 0.000261}
[03/28 16:58:54] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "1250", "eta": "2:32:27", "loss": 0.056741, "lr": 0.042064, "mode": "train", "time_backward": 1.059495, "time_data": 0.017169, "time_diff": 1.479471, "time_forward": 0.399428, "time_loss": 0.000274}
[03/28 16:59:53] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "1260", "eta": "2:32:08", "loss": 0.057841, "lr": 0.042080, "mode": "train", "time_backward": 1.095331, "time_data": 0.017141, "time_diff": 1.511613, "time_forward": 0.398573, "time_loss": 0.000281}
[03/28 17:00:12] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "1270", "eta": "2:31:49", "loss": 0.059364, "lr": 0.042097, "mode": "train", "time_backward": 1.062480, "time_data": 0.017246, "time_diff": 1.486262, "time_forward": 0.400311, "time_loss": 0.000238}
[03/28 17:00:40] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "1280", "eta": "2:31:30", "loss": 0.059538, "lr": 0.042113, "mode": "train", "time_backward": 1.058751, "time_data": 0.017354, "time_diff": 1.480818, "time_forward": 0.399627, "time_loss": 0.000276}
[03/28 17:01:01] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "1290", "eta": "2:31:11", "loss": 0.060495, "lr": 0.042129, "mode": "train", "time_backward": 1.055865, "time_data": 0.017173, "time_diff": 1.482016, "time_forward": 0.404433, "time_loss": 0.000351}
[03/28 17:01:20] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "1300", "eta": "2:30:58", "loss": 0.061455, "lr": 0.042146, "mode": "train", "time_backward": 1.369302, "time_data": 0.016974, "time_diff": 2.104215, "time_forward": 0.713835, "time_loss": 0.000379}
[03/28 17:01:45] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "1310", "eta": "2:30:39", "loss": 0.057688, "lr": 0.042162, "mode": "train", "time_backward": 1.057298, "time_data": 0.016868, "time_diff": 1.479806, "time_forward": 0.399144, "time_loss": 0.000260}
[03/28 17:02:06] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "1320", "eta": "2:30:20", "loss": 0.063571, "lr": 0.042178, "mode": "train", "time_backward": 1.103156, "time_data": 0.032981, "time_diff": 1.538514, "time_forward": 0.401755, "time_loss": 0.000248}
[03/28 17:02:21] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "1330", "eta": "2:30:01", "loss": 0.058954, "lr": 0.042195, "mode": "train", "time_backward": 1.056264, "time_data": 0.016843, "time_diff": 1.484796, "time_forward": 0.400280, "time_loss": 0.000378}
[03/28 17:03:03] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "1340", "eta": "2:29:42", "loss": 0.057882, "lr": 0.042211, "mode": "train", "time_backward": 1.118254, "time_data": 0.021216, "time_diff": 1.549651, "time_forward": 0.398606, "time_loss": 0.008357}
[03/28 17:03:26] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "1350", "eta": "2:29:23", "loss": 0.057211, "lr": 0.042227, "mode": "train", "time_backward": 1.109458, "time_data": 0.017985, "time_diff": 1.544936, "time_forward": 0.399344, "time_loss": 0.000300}
[03/28 17:03:41] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "1360", "eta": "2:29:05", "loss": 0.053507, "lr": 0.042244, "mode": "train", "time_backward": 1.098662, "time_data": 0.017123, "time_diff": 1.571763, "time_forward": 0.400202, "time_loss": 0.035673}
[03/28 17:03:56] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "1370", "eta": "2:28:45", "loss": 0.056669, "lr": 0.042260, "mode": "train", "time_backward": 1.090003, "time_data": 0.018358, "time_diff": 1.516095, "time_forward": 0.401039, "time_loss": 0.000604}
[03/28 17:04:12] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "1380", "eta": "2:28:27", "loss": 0.057129, "lr": 0.042276, "mode": "train", "time_backward": 1.071924, "time_data": 0.017329, "time_diff": 1.551401, "time_forward": 0.402227, "time_loss": 0.000452}
[03/28 17:04:28] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "1390", "eta": "2:28:09", "loss": 0.061593, "lr": 0.042293, "mode": "train", "time_backward": 1.125363, "time_data": 0.020337, "time_diff": 1.551222, "time_forward": 0.398360, "time_loss": 0.000228}
[03/28 17:04:51] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "1400", "eta": "2:27:51", "loss": 0.060823, "lr": 0.042309, "mode": "train", "time_backward": 1.216716, "time_data": 0.017083, "time_diff": 1.645832, "time_forward": 0.399901, "time_loss": 0.000269}
[03/28 17:05:07] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "1410", "eta": "2:27:32", "loss": 0.065266, "lr": 0.042325, "mode": "train", "time_backward": 1.065150, "time_data": 0.017528, "time_diff": 1.538325, "time_forward": 0.415800, "time_loss": 0.000370}
[03/28 17:05:22] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "1420", "eta": "2:27:13", "loss": 0.066124, "lr": 0.042342, "mode": "train", "time_backward": 1.062115, "time_data": 0.017557, "time_diff": 1.499524, "time_forward": 0.398797, "time_loss": 0.000330}
[03/28 17:05:45] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "1430", "eta": "2:26:55", "loss": 0.055859, "lr": 0.042358, "mode": "train", "time_backward": 1.095488, "time_data": 0.020086, "time_diff": 1.528913, "time_forward": 0.401963, "time_loss": 0.000322}
[03/28 17:06:03] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "1440", "eta": "2:26:35", "loss": 0.057800, "lr": 0.042374, "mode": "train", "time_backward": 1.054102, "time_data": 0.021749, "time_diff": 1.503060, "time_forward": 0.419549, "time_loss": 0.000361}
[03/28 17:06:18] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "1450", "eta": "2:26:16", "loss": 0.054461, "lr": 0.042391, "mode": "train", "time_backward": 1.059020, "time_data": 0.016704, "time_diff": 1.494365, "time_forward": 0.397878, "time_loss": 0.000226}
[03/28 17:06:36] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "1460", "eta": "2:25:56", "loss": 0.064283, "lr": 0.042407, "mode": "train", "time_backward": 1.088636, "time_data": 0.017261, "time_diff": 1.517931, "time_forward": 0.400905, "time_loss": 0.000662}
[03/28 17:06:55] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "1470", "eta": "2:25:36", "loss": 0.062873, "lr": 0.042423, "mode": "train", "time_backward": 1.055233, "time_data": 0.020729, "time_diff": 1.495225, "time_forward": 0.415669, "time_loss": 0.000394}
[03/28 17:07:16] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "1480", "eta": "2:25:18", "loss": 0.050115, "lr": 0.042440, "mode": "train", "time_backward": 1.055990, "time_data": 0.016900, "time_diff": 1.482716, "time_forward": 0.398675, "time_loss": 0.000569}
[03/28 17:07:38] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "1490", "eta": "2:24:58", "loss": 0.059664, "lr": 0.042456, "mode": "train", "time_backward": 1.058356, "time_data": 0.017476, "time_diff": 1.478635, "time_forward": 0.399263, "time_loss": 0.000423}
[03/28 17:07:54] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "1500", "eta": "2:24:39", "loss": 0.058862, "lr": 0.042472, "mode": "train", "time_backward": 1.057360, "time_data": 0.024082, "time_diff": 1.541058, "time_forward": 0.457584, "time_loss": 0.000248}
[03/28 17:08:18] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "1510", "eta": "2:24:21", "loss": 0.056702, "lr": 0.042489, "mode": "train", "time_backward": 1.082580, "time_data": 0.018324, "time_diff": 1.510757, "time_forward": 0.401421, "time_loss": 0.000325}
[03/28 17:08:46] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "1520", "eta": "2:25:33", "loss": 0.057017, "lr": 0.042505, "mode": "train", "time_backward": 11.021573, "time_data": 0.017807, "time_diff": 11.447342, "time_forward": 0.399881, "time_loss": 0.000472}
[03/28 17:09:14] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "1530", "eta": "2:25:14", "loss": 0.059926, "lr": 0.042521, "mode": "train", "time_backward": 1.058819, "time_data": 0.020982, "time_diff": 1.531769, "time_forward": 0.446435, "time_loss": 0.000321}
[03/28 17:09:29] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "1540", "eta": "2:24:54", "loss": 0.061890, "lr": 0.042538, "mode": "train", "time_backward": 1.071572, "time_data": 0.018592, "time_diff": 1.502902, "time_forward": 0.404665, "time_loss": 0.000226}
[03/28 17:09:49] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "1550", "eta": "2:24:58", "loss": 0.059488, "lr": 0.042554, "mode": "train", "time_backward": 3.620640, "time_data": 0.017216, "time_diff": 4.061196, "time_forward": 0.399490, "time_loss": 0.000225}
[03/28 17:10:12] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "1560", "eta": "2:25:32", "loss": 0.061160, "lr": 0.042570, "mode": "train", "time_backward": 1.145057, "time_data": 5.452982, "time_diff": 7.361867, "time_forward": 0.678291, "time_loss": 0.000657}
[03/28 17:10:32] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "1570", "eta": "2:25:12", "loss": 0.060270, "lr": 0.042587, "mode": "train", "time_backward": 1.054453, "time_data": 0.017228, "time_diff": 1.544712, "time_forward": 0.469198, "time_loss": 0.000385}
[03/28 17:10:51] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "1580", "eta": "2:24:52", "loss": 0.054175, "lr": 0.042603, "mode": "train", "time_backward": 1.095289, "time_data": 0.017436, "time_diff": 1.517674, "time_forward": 0.401741, "time_loss": 0.000325}
[03/28 17:11:37] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "1590", "eta": "2:25:00", "loss": 0.059353, "lr": 0.042619, "mode": "train", "time_backward": 4.246753, "time_data": 0.016907, "time_diff": 4.666176, "time_forward": 0.398960, "time_loss": 0.000265}
[03/28 17:11:54] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "1600", "eta": "2:24:41", "loss": 0.055935, "lr": 0.042635, "mode": "train", "time_backward": 1.055259, "time_data": 0.016982, "time_diff": 1.483814, "time_forward": 0.399132, "time_loss": 0.000264}
[03/28 17:12:31] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "1610", "eta": "2:24:38", "loss": 0.060840, "lr": 0.042652, "mode": "train", "time_backward": 2.693558, "time_data": 0.019102, "time_diff": 3.445258, "time_forward": 0.403592, "time_loss": 0.000277}
[03/28 17:12:52] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "1620", "eta": "2:24:19", "loss": 0.055043, "lr": 0.042668, "mode": "train", "time_backward": 1.163374, "time_data": 0.021016, "time_diff": 1.588728, "time_forward": 0.399166, "time_loss": 0.000283}
[03/28 17:13:09] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "1630", "eta": "2:23:59", "loss": 0.060847, "lr": 0.042684, "mode": "train", "time_backward": 1.079046, "time_data": 0.017094, "time_diff": 1.507657, "time_forward": 0.407928, "time_loss": 0.000375}
[03/28 17:13:25] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "1640", "eta": "2:23:40", "loss": 0.055131, "lr": 0.042701, "mode": "train", "time_backward": 1.062191, "time_data": 0.019295, "time_diff": 1.513713, "time_forward": 0.404282, "time_loss": 0.000267}
[03/28 17:13:53] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "1650", "eta": "2:23:20", "loss": 0.056053, "lr": 0.042717, "mode": "train", "time_backward": 1.057397, "time_data": 0.022811, "time_diff": 1.482727, "time_forward": 0.399098, "time_loss": 0.000263}
[03/28 17:14:11] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "1660", "eta": "2:23:00", "loss": 0.056776, "lr": 0.042733, "mode": "train", "time_backward": 1.056121, "time_data": 0.017081, "time_diff": 1.477149, "time_forward": 0.398201, "time_loss": 0.000209}
[03/28 17:14:29] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "1670", "eta": "2:22:38", "loss": 0.061437, "lr": 0.042750, "mode": "train", "time_backward": 1.071498, "time_data": 0.019128, "time_diff": 1.515372, "time_forward": 0.407551, "time_loss": 0.012836}
[03/28 17:14:50] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "1680", "eta": "2:22:19", "loss": 0.060870, "lr": 0.042766, "mode": "train", "time_backward": 1.072641, "time_data": 0.017369, "time_diff": 1.581141, "time_forward": 0.403286, "time_loss": 0.000456}
[03/28 17:15:05] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "1690", "eta": "2:21:59", "loss": 0.060738, "lr": 0.042782, "mode": "train", "time_backward": 1.066672, "time_data": 0.017411, "time_diff": 1.489132, "time_forward": 0.398183, "time_loss": 0.000256}
[03/28 17:15:30] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "1700", "eta": "2:21:40", "loss": 0.059860, "lr": 0.042799, "mode": "train", "time_backward": 1.058186, "time_data": 0.019417, "time_diff": 1.483433, "time_forward": 0.401386, "time_loss": 0.000716}
[03/28 17:15:49] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "1710", "eta": "2:21:20", "loss": 0.059767, "lr": 0.042815, "mode": "train", "time_backward": 1.112478, "time_data": 0.017610, "time_diff": 1.538548, "time_forward": 0.400482, "time_loss": 0.000686}
[03/28 17:16:05] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "1720", "eta": "2:21:01", "loss": 0.058307, "lr": 0.042831, "mode": "train", "time_backward": 1.059446, "time_data": 0.017422, "time_diff": 1.483956, "time_forward": 0.403595, "time_loss": 0.000326}
[03/28 17:16:22] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "1730", "eta": "2:20:42", "loss": 0.064424, "lr": 0.042848, "mode": "train", "time_backward": 1.058012, "time_data": 0.017336, "time_diff": 1.528516, "time_forward": 0.449453, "time_loss": 0.000425}
[03/28 17:16:48] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "1740", "eta": "2:20:23", "loss": 0.060770, "lr": 0.042864, "mode": "train", "time_backward": 1.093767, "time_data": 0.017373, "time_diff": 1.563867, "time_forward": 0.445364, "time_loss": 0.001256}
[03/28 17:17:03] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "1750", "eta": "2:20:04", "loss": 0.060550, "lr": 0.042880, "mode": "train", "time_backward": 1.073600, "time_data": 0.017206, "time_diff": 1.496836, "time_forward": 0.399945, "time_loss": 0.000276}
[03/28 17:17:46] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "1760", "eta": "2:19:44", "loss": 0.058948, "lr": 0.042897, "mode": "train", "time_backward": 1.108200, "time_data": 0.017829, "time_diff": 1.534509, "time_forward": 0.401993, "time_loss": 0.000684}
[03/28 17:18:17] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "1770", "eta": "2:21:36", "loss": 0.059857, "lr": 0.042913, "mode": "train", "time_backward": 1.099644, "time_data": 15.091725, "time_diff": 16.648790, "time_forward": 0.453507, "time_loss": 0.000685}
[03/28 17:18:32] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "1780", "eta": "2:21:17", "loss": 0.063767, "lr": 0.042929, "mode": "train", "time_backward": 1.098246, "time_data": 0.017842, "time_diff": 1.518776, "time_forward": 0.399058, "time_loss": 0.000250}
[03/28 17:18:49] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "1790", "eta": "2:20:57", "loss": 0.057683, "lr": 0.042946, "mode": "train", "time_backward": 1.054428, "time_data": 0.017664, "time_diff": 1.483721, "time_forward": 0.408063, "time_loss": 0.000267}
[03/28 17:19:05] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "1800", "eta": "2:20:39", "loss": 0.056004, "lr": 0.042962, "mode": "train", "time_backward": 1.180754, "time_data": 0.016871, "time_diff": 1.666601, "time_forward": 0.468419, "time_loss": 0.000272}
[03/28 17:19:20] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "1810", "eta": "2:20:12", "loss": 0.057027, "lr": 0.042978, "mode": "train", "time_backward": 1.056423, "time_data": 0.050867, "time_diff": 1.509085, "time_forward": 0.398628, "time_loss": 0.000253}
[03/28 17:19:45] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "1820", "eta": "2:19:53", "loss": 0.064495, "lr": 0.042995, "mode": "train", "time_backward": 1.056488, "time_data": 0.024118, "time_diff": 1.532742, "time_forward": 0.408543, "time_loss": 0.000275}
[03/28 17:20:00] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "1830", "eta": "2:19:33", "loss": 0.059503, "lr": 0.043011, "mode": "train", "time_backward": 1.068941, "time_data": 0.016865, "time_diff": 1.521494, "time_forward": 0.416818, "time_loss": 0.000294}
[03/28 17:20:16] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "1840", "eta": "2:19:13", "loss": 0.057996, "lr": 0.043027, "mode": "train", "time_backward": 1.135010, "time_data": 0.017104, "time_diff": 1.598162, "time_forward": 0.398978, "time_loss": 0.000312}
[03/28 17:20:31] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "1850", "eta": "2:18:53", "loss": 0.050797, "lr": 0.043044, "mode": "train", "time_backward": 1.051988, "time_data": 0.019290, "time_diff": 1.476742, "time_forward": 0.401636, "time_loss": 0.000264}
[03/28 17:20:54] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "1860", "eta": "2:18:34", "loss": 0.060445, "lr": 0.043060, "mode": "train", "time_backward": 1.063201, "time_data": 0.016946, "time_diff": 1.484771, "time_forward": 0.400689, "time_loss": 0.000421}
[03/28 17:21:29] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "1870", "eta": "2:18:14", "loss": 0.060424, "lr": 0.043076, "mode": "train", "time_backward": 1.073716, "time_data": 0.017003, "time_diff": 1.545769, "time_forward": 0.443788, "time_loss": 0.006365}
[03/28 17:21:45] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "1880", "eta": "2:16:30", "loss": 0.058079, "lr": 0.043093, "mode": "train", "time_backward": 1.107706, "time_data": 0.017038, "time_diff": 1.530263, "time_forward": 0.400105, "time_loss": 0.000345}
[03/28 17:22:11] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "1890", "eta": "2:16:11", "loss": 0.058077, "lr": 0.043109, "mode": "train", "time_backward": 1.099264, "time_data": 0.017376, "time_diff": 1.516328, "time_forward": 0.399180, "time_loss": 0.000242}
[03/28 17:22:34] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "1900", "eta": "2:15:50", "loss": 0.062720, "lr": 0.043125, "mode": "train", "time_backward": 1.056740, "time_data": 0.017362, "time_diff": 1.480686, "time_forward": 0.398633, "time_loss": 0.000314}
[03/28 17:23:13] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "1910", "eta": "2:15:32", "loss": 0.060269, "lr": 0.043142, "mode": "train", "time_backward": 1.057769, "time_data": 0.016967, "time_diff": 1.624795, "time_forward": 0.494021, "time_loss": 0.000277}
[03/28 17:23:48] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "1920", "eta": "2:15:13", "loss": 0.061690, "lr": 0.043158, "mode": "train", "time_backward": 1.056831, "time_data": 0.017013, "time_diff": 1.479805, "time_forward": 0.400041, "time_loss": 0.000454}
[03/28 17:24:22] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "1930", "eta": "2:14:54", "loss": 0.060731, "lr": 0.043174, "mode": "train", "time_backward": 1.113709, "time_data": 0.018606, "time_diff": 1.540030, "time_forward": 0.400725, "time_loss": 0.000260}
[03/28 17:24:37] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "1940", "eta": "2:14:35", "loss": 0.057693, "lr": 0.043191, "mode": "train", "time_backward": 1.163820, "time_data": 0.029123, "time_diff": 1.622470, "time_forward": 0.421795, "time_loss": 0.000282}
[03/28 17:25:04] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "1950", "eta": "2:14:06", "loss": 0.059021, "lr": 0.043207, "mode": "train", "time_backward": 1.063782, "time_data": 0.020134, "time_diff": 1.509248, "time_forward": 0.399643, "time_loss": 0.000254}
[03/28 17:25:19] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "1960", "eta": "2:13:46", "loss": 0.054592, "lr": 0.043223, "mode": "train", "time_backward": 1.054530, "time_data": 0.017071, "time_diff": 1.476790, "time_forward": 0.397584, "time_loss": 0.000221}
[03/28 17:26:03] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "1970", "eta": "2:15:51", "loss": 0.062956, "lr": 0.043240, "mode": "train", "time_backward": 18.675357, "time_data": 0.016946, "time_diff": 19.093378, "time_forward": 0.397608, "time_loss": 0.000292}
[03/28 17:26:36] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "1980", "eta": "2:15:31", "loss": 0.057846, "lr": 0.043256, "mode": "train", "time_backward": 1.066568, "time_data": 0.016900, "time_diff": 1.495090, "time_forward": 0.398880, "time_loss": 0.000273}
[03/28 17:26:55] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "1990", "eta": "2:12:34", "loss": 0.054133, "lr": 0.043272, "mode": "train", "time_backward": 1.092193, "time_data": 0.022897, "time_diff": 1.537327, "time_forward": 0.400190, "time_loss": 0.000391}
[03/28 17:27:11] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "2000", "eta": "2:12:16", "loss": 0.060827, "lr": 0.043289, "mode": "train", "time_backward": 1.082905, "time_data": 0.018925, "time_diff": 1.821281, "time_forward": 0.715569, "time_loss": 0.000605}
[03/28 17:27:35] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "2010", "eta": "2:12:30", "loss": 0.057767, "lr": 0.043305, "mode": "train", "time_backward": 5.239832, "time_data": 0.022573, "time_diff": 5.667977, "time_forward": 0.399135, "time_loss": 0.000251}
[03/28 17:28:11] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "2020", "eta": "2:12:11", "loss": 0.057888, "lr": 0.043321, "mode": "train", "time_backward": 1.057073, "time_data": 0.017373, "time_diff": 1.481628, "time_forward": 0.403553, "time_loss": 0.000491}
[03/28 17:28:47] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "2030", "eta": "2:11:51", "loss": 0.057140, "lr": 0.043337, "mode": "train", "time_backward": 1.100587, "time_data": 0.016714, "time_diff": 1.523407, "time_forward": 0.399538, "time_loss": 0.000334}
[03/28 17:29:02] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "2040", "eta": "2:11:32", "loss": 0.053013, "lr": 0.043354, "mode": "train", "time_backward": 1.150490, "time_data": 0.017518, "time_diff": 1.579497, "time_forward": 0.402508, "time_loss": 0.000313}
[03/28 17:29:18] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "2050", "eta": "2:11:12", "loss": 0.054267, "lr": 0.043370, "mode": "train", "time_backward": 1.055885, "time_data": 0.018249, "time_diff": 1.478650, "time_forward": 0.400611, "time_loss": 0.000684}
[03/28 17:29:56] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "2060", "eta": "2:10:53", "loss": 0.055660, "lr": 0.043386, "mode": "train", "time_backward": 1.065032, "time_data": 0.037009, "time_diff": 1.587320, "time_forward": 0.470565, "time_loss": 0.000297}
[03/28 17:30:11] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "2070", "eta": "2:10:34", "loss": 0.058711, "lr": 0.043403, "mode": "train", "time_backward": 1.054949, "time_data": 0.016675, "time_diff": 1.475767, "time_forward": 0.398590, "time_loss": 0.000263}
[03/28 17:30:47] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "2080", "eta": "2:10:16", "loss": 0.059303, "lr": 0.043419, "mode": "train", "time_backward": 1.105960, "time_data": 0.018055, "time_diff": 1.618317, "time_forward": 0.431271, "time_loss": 0.000256}
[03/28 17:31:14] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "2090", "eta": "2:11:12", "loss": 0.064037, "lr": 0.043435, "mode": "train", "time_backward": 10.640812, "time_data": 0.017092, "time_diff": 11.076156, "time_forward": 0.401132, "time_loss": 0.000943}
[03/28 17:31:31] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "2100", "eta": "2:11:10", "loss": 0.062169, "lr": 0.043452, "mode": "train", "time_backward": 3.215830, "time_data": 0.017294, "time_diff": 3.636506, "time_forward": 0.399996, "time_loss": 0.000240}
[03/28 17:31:57] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "2110", "eta": "2:10:50", "loss": 0.058431, "lr": 0.043468, "mode": "train", "time_backward": 1.063115, "time_data": 0.017137, "time_diff": 1.487557, "time_forward": 0.398871, "time_loss": 0.000266}
[03/28 17:32:13] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "2120", "eta": "2:10:24", "loss": 0.063620, "lr": 0.043484, "mode": "train", "time_backward": 1.058087, "time_data": 0.018024, "time_diff": 1.523375, "time_forward": 0.399536, "time_loss": 0.000359}
[03/28 17:32:29] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "2130", "eta": "2:10:05", "loss": 0.061237, "lr": 0.043501, "mode": "train", "time_backward": 1.120780, "time_data": 0.024629, "time_diff": 1.572247, "time_forward": 0.405821, "time_loss": 0.000273}
[03/28 17:32:54] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "2140", "eta": "2:09:42", "loss": 0.063446, "lr": 0.043517, "mode": "train", "time_backward": 1.057308, "time_data": 0.016879, "time_diff": 1.478939, "time_forward": 0.400171, "time_loss": 0.000346}
[03/28 17:33:10] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "2150", "eta": "2:09:22", "loss": 0.057618, "lr": 0.043533, "mode": "train", "time_backward": 1.075359, "time_data": 0.016894, "time_diff": 1.511239, "time_forward": 0.411163, "time_loss": 0.000267}
[03/28 17:33:25] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "2160", "eta": "2:08:59", "loss": 0.052811, "lr": 0.043550, "mode": "train", "time_backward": 1.055598, "time_data": 0.016896, "time_diff": 1.476751, "time_forward": 0.399703, "time_loss": 0.000297}
[03/28 17:34:11] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "2170", "eta": "2:08:38", "loss": 0.059543, "lr": 0.043566, "mode": "train", "time_backward": 1.059720, "time_data": 0.016879, "time_diff": 1.486106, "time_forward": 0.398153, "time_loss": 0.000272}
[03/28 17:34:35] pa.utils.logging INFO: json_stats: {"_type": "train_iter", "cur_epoch": "5", "cur_iter": "2180", "eta": "2:09:24", "loss": 0.063249, "lr": 0.043582, "mode": "train", "time_backward": 9.451243, "
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment