基于PaddleSeg的自动驾驶道路分割模型

车道线分割是图像分割领域的热点问题,在自动驾驶领域有着重要应用。

本次项目根据要求训练一个基于飞桨PaddleSeg的自动驾驶道路分割模型,采用实时语义分割网络BiSeNet,使用多个经预处理的训练集进行训练,并在训练结束后从多角度对模型进行评估与可视化,最后保存导出模型,以备科研实训下一阶段使用。

  • 选择main.ipynb进入此项目

  • 请务必选择2022_0723_Release3版本打开项目。

  • 请务必选择GPU版本运行此项目

  • 以下为完整的模型训练过程,逐段执行即可完成数据集准备、模型训练、模型评估与可视化、模型保存导出全过程。

  • 所有代码均可直接按顺序运行,由于AI Studio自身的技术原因,部分代码底部有红色波浪线,它们同样可以正常运行

  • 在模型可视化部分,我们提供了发布前最后一次训练的可视化图表,如果您希望生成查看自己训练的模型的可视化结果,可以在左侧点击可视化按钮,并按照指引选择logdir与自己的模型文件,即可生成当次训练评估的模型可视化图表

山东大学数据科学与人工智能实验班

小组成员:陈其轩 王启帆 何金原 高梓又

参考资料:

PaddleSeg实战——人像分割(最新版) https://aistudio.baidu.com/aistudio/projectdetail/2189481

轻量级实时语义分割经典BiSeNet及其进化BiSeNet v2 https://zhuanlan.zhihu.com/p/141692672

BiSeNet: Bilateral Segmentation Network for Real-time Semantic Segmentation

1.PaddleSeg安装与环境配置

PaddleSeg是基于飞桨PaddlePaddle开发的端到端图像分割开发套件,涵盖了高精度和轻量级等不同方向的大量高质量分割模型。通过模块化的设计,提供了配置化驱动和API调用两种应用方式,帮助开发者更便捷地完成从训练到部署的全流程图像分割应用。
#使用git下载paddleseg 
# github地址
#! git clone https://github.com/PaddlePaddle/PaddleSeg

# gitee地址,推荐
! git clone https://gitee.com/paddlepaddle/PaddleSeg.git
正克隆到 'PaddleSeg'...
remote: Enumerating objects: 18055, done.[K
remote: Counting objects: 100% (3018/3018), done.[K
remote: Compressing objects: 100% (1534/1534), done.[K
remote: Total 18055 (delta 1698), reused 2505 (delta 1435), pack-reused 15037[K
接收对象中: 100% (18055/18055), 341.84 MiB | 11.88 MiB/s, 完成.
处理 delta 中: 100% (11562/11562), 完成.
检查连接... 完成。
#安装环境
!pip install -r PaddleSeg/requirements.txt

#运行验证 为节省时间可跳过
#! python train.py --config configs/quick_start/bisenet_optic_disc_512x512_1k.yml
Looking in indexes: https://pypi.tuna.tsinghua.edu.cn/simple
Requirement already satisfied: pyyaml>=5.1 in /opt/conda/envs/python35-paddle120-env/lib/python3.7/site-packages (from -r PaddleSeg/requirements.txt (line 1)) (5.1.2)
Requirement already satisfied: visualdl>=2.0.0 in /opt/conda/envs/python35-paddle120-env/lib/python3.7/site-packages (from -r PaddleSeg/requirements.txt (line 2)) (2.3.0)
Requirement already satisfied: opencv-python in /opt/conda/envs/python35-paddle120-env/lib/python3.7/site-packages (from -r PaddleSeg/requirements.txt (line 3)) (4.1.1.26)
Requirement already satisfied: tqdm in /opt/conda/envs/python35-paddle120-env/lib/python3.7/site-packages (from -r PaddleSeg/requirements.txt (line 4)) (4.27.0)
Requirement already satisfied: filelock in /opt/conda/envs/python35-paddle120-env/lib/python3.7/site-packages (from -r PaddleSeg/requirements.txt (line 5)) (3.0.12)
Requirement already satisfied: scipy in /opt/conda/envs/python35-paddle120-env/lib/python3.7/site-packages (from -r PaddleSeg/requirements.txt (line 6)) (1.6.3)
Requirement already satisfied: prettytable in /opt/conda/envs/python35-paddle120-env/lib/python3.7/site-packages (from -r PaddleSeg/requirements.txt (line 7)) (0.7.2)
Requirement already satisfied: sklearn in /opt/conda/envs/python35-paddle120-env/lib/python3.7/site-packages (from -r PaddleSeg/requirements.txt (line 8)) (0.0)
Requirement already satisfied: numpy in /opt/conda/envs/python35-paddle120-env/lib/python3.7/site-packages (from visualdl>=2.0.0->-r PaddleSeg/requirements.txt (line 2)) (1.19.5)
Requirement already satisfied: matplotlib in /opt/conda/envs/python35-paddle120-env/lib/python3.7/site-packages (from visualdl>=2.0.0->-r PaddleSeg/requirements.txt (line 2)) (2.2.3)
Requirement already satisfied: flask>=1.1.1 in /opt/conda/envs/python35-paddle120-env/lib/python3.7/site-packages (from visualdl>=2.0.0->-r PaddleSeg/requirements.txt (line 2)) (1.1.1)
Requirement already satisfied: Flask-Babel>=1.0.0 in /opt/conda/envs/python35-paddle120-env/lib/python3.7/site-packages (from visualdl>=2.0.0->-r PaddleSeg/requirements.txt (line 2)) (1.0.0)
Requirement already satisfied: protobuf>=3.11.0 in /opt/conda/envs/python35-paddle120-env/lib/python3.7/site-packages (from visualdl>=2.0.0->-r PaddleSeg/requirements.txt (line 2)) (3.20.0)
Requirement already satisfied: pandas in /opt/conda/envs/python35-paddle120-env/lib/python3.7/site-packages (from visualdl>=2.0.0->-r PaddleSeg/requirements.txt (line 2)) (1.1.5)
Requirement already satisfied: requests in /opt/conda/envs/python35-paddle120-env/lib/python3.7/site-packages (from visualdl>=2.0.0->-r PaddleSeg/requirements.txt (line 2)) (2.24.0)
Requirement already satisfied: six>=1.14.0 in /opt/conda/envs/python35-paddle120-env/lib/python3.7/site-packages (from visualdl>=2.0.0->-r PaddleSeg/requirements.txt (line 2)) (1.16.0)
Requirement already satisfied: Pillow>=7.0.0 in /opt/conda/envs/python35-paddle120-env/lib/python3.7/site-packages (from visualdl>=2.0.0->-r PaddleSeg/requirements.txt (line 2)) (8.2.0)
Requirement already satisfied: bce-python-sdk in /opt/conda/envs/python35-paddle120-env/lib/python3.7/site-packages (from visualdl>=2.0.0->-r PaddleSeg/requirements.txt (line 2)) (0.8.53)
Requirement already satisfied: scikit-learn in /opt/conda/envs/python35-paddle120-env/lib/python3.7/site-packages (from sklearn->-r PaddleSeg/requirements.txt (line 8)) (0.24.2)
Requirement already satisfied: click>=5.1 in /opt/conda/envs/python35-paddle120-env/lib/python3.7/site-packages (from flask>=1.1.1->visualdl>=2.0.0->-r PaddleSeg/requirements.txt (line 2)) (7.0)
Requirement already satisfied: Jinja2>=2.10.1 in /opt/conda/envs/python35-paddle120-env/lib/python3.7/site-packages (from flask>=1.1.1->visualdl>=2.0.0->-r PaddleSeg/requirements.txt (line 2)) (3.0.0)
Requirement already satisfied: itsdangerous>=0.24 in /opt/conda/envs/python35-paddle120-env/lib/python3.7/site-packages (from flask>=1.1.1->visualdl>=2.0.0->-r PaddleSeg/requirements.txt (line 2)) (1.1.0)
Requirement already satisfied: Werkzeug>=0.15 in /opt/conda/envs/python35-paddle120-env/lib/python3.7/site-packages (from flask>=1.1.1->visualdl>=2.0.0->-r PaddleSeg/requirements.txt (line 2)) (0.16.0)
Requirement already satisfied: Babel>=2.3 in /opt/conda/envs/python35-paddle120-env/lib/python3.7/site-packages (from Flask-Babel>=1.0.0->visualdl>=2.0.0->-r PaddleSeg/requirements.txt (line 2)) (2.8.0)
Requirement already satisfied: pytz in /opt/conda/envs/python35-paddle120-env/lib/python3.7/site-packages (from Flask-Babel>=1.0.0->visualdl>=2.0.0->-r PaddleSeg/requirements.txt (line 2)) (2019.3)
Requirement already satisfied: pycryptodome>=3.8.0 in /opt/conda/envs/python35-paddle120-env/lib/python3.7/site-packages (from bce-python-sdk->visualdl>=2.0.0->-r PaddleSeg/requirements.txt (line 2)) (3.9.9)
Requirement already satisfied: future>=0.6.0 in /opt/conda/envs/python35-paddle120-env/lib/python3.7/site-packages (from bce-python-sdk->visualdl>=2.0.0->-r PaddleSeg/requirements.txt (line 2)) (0.18.0)
Requirement already satisfied: pyparsing!=2.0.4,!=2.1.2,!=2.1.6,>=2.0.1 in /opt/conda/envs/python35-paddle120-env/lib/python3.7/site-packages (from matplotlib->visualdl>=2.0.0->-r PaddleSeg/requirements.txt (line 2)) (3.0.9)
Requirement already satisfied: cycler>=0.10 in /opt/conda/envs/python35-paddle120-env/lib/python3.7/site-packages (from matplotlib->visualdl>=2.0.0->-r PaddleSeg/requirements.txt (line 2)) (0.10.0)
Requirement already satisfied: python-dateutil>=2.1 in /opt/conda/envs/python35-paddle120-env/lib/python3.7/site-packages (from matplotlib->visualdl>=2.0.0->-r PaddleSeg/requirements.txt (line 2)) (2.8.2)
Requirement already satisfied: kiwisolver>=1.0.1 in /opt/conda/envs/python35-paddle120-env/lib/python3.7/site-packages (from matplotlib->visualdl>=2.0.0->-r PaddleSeg/requirements.txt (line 2)) (1.1.0)
Requirement already satisfied: urllib3!=1.25.0,!=1.25.1,<1.26,>=1.21.1 in /opt/conda/envs/python35-paddle120-env/lib/python3.7/site-packages (from requests->visualdl>=2.0.0->-r PaddleSeg/requirements.txt (line 2)) (1.25.6)
Requirement already satisfied: idna<3,>=2.5 in /opt/conda/envs/python35-paddle120-env/lib/python3.7/site-packages (from requests->visualdl>=2.0.0->-r PaddleSeg/requirements.txt (line 2)) (2.8)
Requirement already satisfied: certifi>=2017.4.17 in /opt/conda/envs/python35-paddle120-env/lib/python3.7/site-packages (from requests->visualdl>=2.0.0->-r PaddleSeg/requirements.txt (line 2)) (2019.9.11)
Requirement already satisfied: chardet<4,>=3.0.2 in /opt/conda/envs/python35-paddle120-env/lib/python3.7/site-packages (from requests->visualdl>=2.0.0->-r PaddleSeg/requirements.txt (line 2)) (3.0.4)
Requirement already satisfied: joblib>=0.11 in /opt/conda/envs/python35-paddle120-env/lib/python3.7/site-packages (from scikit-learn->sklearn->-r PaddleSeg/requirements.txt (line 8)) (0.14.1)
Requirement already satisfied: threadpoolctl>=2.0.0 in /opt/conda/envs/python35-paddle120-env/lib/python3.7/site-packages (from scikit-learn->sklearn->-r PaddleSeg/requirements.txt (line 8)) (2.1.0)
Requirement already satisfied: MarkupSafe>=2.0.0rc2 in /opt/conda/envs/python35-paddle120-env/lib/python3.7/site-packages (from Jinja2>=2.10.1->flask>=1.1.1->visualdl>=2.0.0->-r PaddleSeg/requirements.txt (line 2)) (2.0.1)
Requirement already satisfied: setuptools in /opt/conda/envs/python35-paddle120-env/lib/python3.7/site-packages (from kiwisolver>=1.0.1->matplotlib->visualdl>=2.0.0->-r PaddleSeg/requirements.txt (line 2)) (56.2.0)

2.数据集准备

2.1数据集选取

为满足尽可能多使用数据集的作业要求,本项目共挂载4个数据集。由于AI Studio对挂载数据集数量的限制(最多可挂载2个数据集),我们将其中三个数据进行处理制作为一个大型数据集集合并进行挂载。所用到的数据集分别为:

车道线检测-初赛https://aistudio.baidu.com/aistudio/datasetdetail/54076

车道线检测数据集https://aistudio.baidu.com/aistudio/datasetdetail/68698

2020中国华录杯·数据湖算法大赛—定向算法赛(车道线识别)https://aistudio.baidu.com/aistudio/datasetdetail/54289

智能车2022baseline数据集 https://aistudio.baidu.com/aistudio/datasetdetail/125507

2.2数据集结构整理

在训练自定义数据集前,我们将数据集转化为以下结构:

custom_dataset
    |
    |--images
    |  |--image1.jpg
    |  |--image2.jpg
    |  |--...
    |
    |--labels
    |  |--label1.jpg
    |  |--label2.png
    |  |--...
    |
    |--train.txt
    |
    |--val.txt
    |
    |--test.txt

其中train.txt和val.txt的内容如下:

images/image1.jpg labels/label1.png
images/image2.jpg labels/label2.png
...

2.3yml文件修改与参数调整

在完成上述操作后,我们需要将自定义数据集的信息写入yml文件

应修改的文件地址:/home/aistudio/PaddleSeg/configs/quick_start/bisenet_optic_disc_512x512_1k.yml

此处我们已经编写好yml文件,使用Linux命令行对原有yml文件进行替换即可

! mv bisenet_optic_disc_512x512_1k.yml /home/aistudio/PaddleSeg/configs/quick_start/

以该yml文件为示例讲解yml文件的编写方法:

batch_size: 4  #一次训练所抓取的数据样本数量
iters: 20000  #迭代次数

train_dataset: 
  type: Dataset
  dataset_root: /home/aistudio/custom_dataset/dataset2   #train.txt所在目录
  train_path: /home/aistudio/custom_dataset/dataset2/train.txt   #train.txt地址
  num_classes: 20    #分类数量
  transforms:     #图像处理
    - type: Resize
      target_size: [512, 512]  #Resize大小
    - type: RandomHorizontalFlip  #图像增广方法
    - type: Normalize  #正则化
  mode: train

val_dataset: 
  type: Dataset
  dataset_root: /home/aistudio/custom_dataset/dataset2  #val.txt所在目录
  val_path: /home/aistudio/custom_dataset/dataset2/val.txt  #val.txt地址
  num_classes: 20  #分类数量
  transforms:  #图像处理
    - type: Resize
      target_size: [512, 512]  #Resize大小
    - type: Normalize  #正则化
  mode: val

optimizer:  #优化器
  type: sgd  #随机梯度下降
  momentum: 0.9  #momentum动量法
  weight_decay: 4.0e-5  #权重衰减

lr_scheduler:  #学习率调节
  type: PolynomialDecay  #多项式衰减
  learning_rate: 0.01  #学习率
  end_lr: 0
  power: 0.9

loss:  #损失函数
  types:
    - type: CrossEntropyLoss  #交叉熵损失函数
  coef: [1, 1, 1, 1, 1]

model:
  type: BiSeNetV2  #BiSeNet
  pretrained: Null

2.4解压数据集

# 利用Python解压数据集集合
! python /home/aistudio/unzip.py

3.模型训练

3.1模型选择

3.1.1轻量级语义分割

基于轻量化网络模型的设计作为一个热门的研究方法,许多研究者都在运算量、参数量和精度之间寻找平衡,希望使用尽量少的运算量和参数量的同时获得较高的模型精度。目前,轻量级模型主要有SqueezeNet、MobileNet系列和ShuffleNet系列等,这些模型在图像分类领域取得了不错的效果,可以作为基本的主干网络应用于语义分割任务当中。

然而,在语义分割领域,由于需要对输入图片进行逐像素的分类,运算量很大。通常,为了减少语义分割所产生的计算量,通常而言有两种方式:减小图片大小和降低模型复杂度。减小图片大小可以最直接地减少运算量,但是图像会丢失掉大量的细节从而影响精度。降低模型复杂度则会导致模型的特征提取能力减弱,从而影响分割精度。所以,如何在语义分割任务中应用轻量级模型,兼顾实时性和精度性能具有相当大的挑战性。

3.1.2主流加速方式

当前主要有三种加速方法:

  • 通过剪裁或 resize 来限定输入大小,以降低计算复杂度。尽管这种方法简单而有效,空间细节的损失还是让预测打了折扣,尤其是边界部分,导致度量和可视化的精度下降;
  • 通过减少网络通道数量加快处理速度,尤其是在骨干模型的早期阶段,但是这会弱化空间信息。
  • 为追求极其紧凑的框架而丢弃模型的最后阶段(比如ENet)。该方法的缺点也很明显:由于 ENet 抛弃了最后阶段的下采样,模型的感受野不足以涵盖大物体,导致判别能力较差。

这些提速的方法会丢失很多 Spatial Details 或者牺牲 Spatial Capacity,从而导致精度大幅下降。为了弥补空间信息的丢失,有些算法会采用 U-shape 的方式恢复空间信息。但是,U-shape 会降低速度,同时很多丢失的信息并不能简单地通过融合浅层特征来恢复。

3.1.3BiSeNet

BiSeNet: Bilateral Segmentation Network for Real-time Semantic Segmentation中出了一种新的双向分割网络BiSeNet。首先,设计了一个带有小步长的空间路径来保留空间位置信息生成高分辨率的特征图;同时设计了一个带有快速下采样率的语义路径来获取客观的感受野。在这两个模块之上引入一个新的特征融合模块将二者的特征图进行融合,实现速度和精度的平衡

3.2BiSeNet模型训练

# 进行模型训练
! python /home/aistudio/PaddleSeg/train.py --config /home/aistudio/PaddleSeg/configs/quick_start/bisenet_optic_disc_512x512_1k.yml --do_eval \
       --use_vdl --save_interval 4000 --save_dir output 
/home/aistudio/PaddleSeg/paddleseg/models/losses/rmi_loss.py:78: DeprecationWarning: invalid escape sequence \i
  """
2022-07-21 14:55:26 [INFO]	
------------Environment Information-------------
platform: Linux-4.15.0-140-generic-x86_64-with-debian-stretch-sid
Python: 3.7.4 (default, Aug 13 2019, 20:35:49) [GCC 7.3.0]
Paddle compiled with cuda: True
NVCC: Cuda compilation tools, release 10.1, V10.1.243
cudnn: 7.6
GPUs used: 1
CUDA_VISIBLE_DEVICES: None
GPU: ['GPU 0: Tesla V100-SXM2-16GB']
GCC: gcc (Ubuntu 7.5.0-3ubuntu1~16.04) 7.5.0
PaddleSeg: 2.5.0
PaddlePaddle: 2.3.1
OpenCV: 4.1.1
------------------------------------------------
2022-07-21 14:55:27 [INFO]	
---------------Config Information---------------
batch_size: 4
iters: 20000
loss:
  coef:
  - 1
  - 1
  - 1
  - 1
  - 1
  types:
  - ignore_index: 255
    type: CrossEntropyLoss
lr_scheduler:
  end_lr: 0
  learning_rate: 0.01
  power: 0.9
  type: PolynomialDecay
model:
  pretrained: null
  type: BiSeNetV2
optimizer:
  momentum: 0.9
  type: sgd
  weight_decay: 4.0e-05
train_dataset:
  dataset_root: /home/aistudio/custom_dataset/dataset2
  mode: train
  num_classes: 20
  train_path: /home/aistudio/custom_dataset/dataset2/train.txt
  transforms:
  - target_size:
    - 512
    - 512
    type: Resize
  - type: RandomHorizontalFlip
  - type: Normalize
  type: Dataset
val_dataset:
  dataset_root: /home/aistudio/custom_dataset/dataset2
  mode: val
  num_classes: 20
  transforms:
  - target_size:
    - 512
    - 512
    type: Resize
  - type: Normalize
  type: Dataset
  val_path: /home/aistudio/custom_dataset/dataset2/val.txt
------------------------------------------------
W0721 14:55:27.411620  1235 gpu_resources.cc:61] Please NOTE: device: 0, GPU Compute Capability: 7.0, Driver API Version: 11.2, Runtime API Version: 10.1
W0721 14:55:27.411733  1235 gpu_resources.cc:91] device: 0, cuDNN Version: 7.6.
/opt/conda/envs/python35-paddle120-env/lib/python3.7/site-packages/paddle/fluid/dygraph/math_op_patch.py:278: UserWarning: The dtype of left and right variables are not the same, left dtype is paddle.float32, but right dtype is paddle.int64, the right dtype will convert to paddle.float32
  format(lhs_dtype, rhs_dtype, lhs_dtype))
2022-07-21 14:55:54 [INFO]	[TRAIN] epoch: 1, iter: 10/20000, loss: 11.9167, lr: 0.009996, batch_cost: 2.2641, reader_cost: 0.77581, ips: 1.7667 samples/sec | ETA 12:34:18
2022-07-21 14:55:57 [INFO]	[TRAIN] epoch: 1, iter: 20/20000, loss: 2.5850, lr: 0.009991, batch_cost: 0.2598, reader_cost: 0.13210, ips: 15.3944 samples/sec | ETA 01:26:31
2022-07-21 14:55:59 [INFO]	[TRAIN] epoch: 1, iter: 30/20000, loss: 1.1348, lr: 0.009987, batch_cost: 0.2198, reader_cost: 0.09766, ips: 18.1970 samples/sec | ETA 01:13:09
2022-07-21 14:56:01 [INFO]	[TRAIN] epoch: 1, iter: 40/20000, loss: 0.7021, lr: 0.009982, batch_cost: 0.2444, reader_cost: 0.11737, ips: 16.3676 samples/sec | ETA 01:21:17
2022-07-21 14:56:04 [INFO]	[TRAIN] epoch: 1, iter: 50/20000, loss: 0.6947, lr: 0.009978, batch_cost: 0.2438, reader_cost: 0.12147, ips: 16.4102 samples/sec | ETA 01:21:02
2022-07-21 14:56:06 [INFO]	[TRAIN] epoch: 1, iter: 60/20000, loss: 0.5881, lr: 0.009973, batch_cost: 0.2378, reader_cost: 0.11496, ips: 16.8229 samples/sec | ETA 01:19:01
2022-07-21 14:56:08 [INFO]	[TRAIN] epoch: 1, iter: 70/20000, loss: 0.8604, lr: 0.009969, batch_cost: 0.2278, reader_cost: 0.10715, ips: 17.5629 samples/sec | ETA 01:15:39
2022-07-21 14:56:11 [INFO]	[TRAIN] epoch: 1, iter: 80/20000, loss: 0.6053, lr: 0.009964, batch_cost: 0.2561, reader_cost: 0.11706, ips: 15.6195 samples/sec | ETA 01:25:01
2022-07-21 14:56:14 [INFO]	[TRAIN] epoch: 1, iter: 90/20000, loss: 0.6464, lr: 0.009960, batch_cost: 0.2548, reader_cost: 0.12605, ips: 15.7008 samples/sec | ETA 01:24:32
2022-07-21 14:56:16 [INFO]	[TRAIN] epoch: 1, iter: 100/20000, loss: 0.5420, lr: 0.009955, batch_cost: 0.2432, reader_cost: 0.11942, ips: 16.4474 samples/sec | ETA 01:20:39
2022-07-21 14:56:19 [INFO]	[TRAIN] epoch: 1, iter: 110/20000, loss: 0.5387, lr: 0.009951, batch_cost: 0.2530, reader_cost: 0.13051, ips: 15.8098 samples/sec | ETA 01:23:52
2022-07-21 14:56:21 [INFO]	[TRAIN] epoch: 1, iter: 120/20000, loss: 0.5950, lr: 0.009946, batch_cost: 0.2387, reader_cost: 0.10965, ips: 16.7573 samples/sec | ETA 01:19:05
2022-07-21 14:56:24 [INFO]	[TRAIN] epoch: 1, iter: 130/20000, loss: 0.6278, lr: 0.009942, batch_cost: 0.2755, reader_cost: 0.14333, ips: 14.5177 samples/sec | ETA 01:31:14
2022-07-21 14:56:26 [INFO]	[TRAIN] epoch: 1, iter: 140/20000, loss: 0.4930, lr: 0.009937, batch_cost: 0.2368, reader_cost: 0.11549, ips: 16.8916 samples/sec | ETA 01:18:22
2022-07-21 14:56:29 [INFO]	[TRAIN] epoch: 1, iter: 150/20000, loss: 0.5909, lr: 0.009933, batch_cost: 0.2500, reader_cost: 0.12266, ips: 16.0025 samples/sec | ETA 01:22:41
2022-07-21 14:56:31 [INFO]	[TRAIN] epoch: 1, iter: 160/20000, loss: 0.5714, lr: 0.009928, batch_cost: 0.2551, reader_cost: 0.13288, ips: 15.6785 samples/sec | ETA 01:24:21
2022-07-21 14:56:34 [INFO]	[TRAIN] epoch: 1, iter: 170/20000, loss: 0.5105, lr: 0.009924, batch_cost: 0.2492, reader_cost: 0.12710, ips: 16.0528 samples/sec | ETA 01:22:21
2022-07-21 14:56:36 [INFO]	[TRAIN] epoch: 1, iter: 180/20000, loss: 0.4768, lr: 0.009919, batch_cost: 0.2407, reader_cost: 0.11817, ips: 16.6198 samples/sec | ETA 01:19:30
2022-07-21 14:56:38 [INFO]	[TRAIN] epoch: 1, iter: 190/20000, loss: 0.5907, lr: 0.009915, batch_cost: 0.2414, reader_cost: 0.11939, ips: 16.5726 samples/sec | ETA 01:19:41
2022-07-21 14:56:41 [INFO]	[TRAIN] epoch: 1, iter: 200/20000, loss: 0.7546, lr: 0.009910, batch_cost: 0.2728, reader_cost: 0.14395, ips: 14.6610 samples/sec | ETA 01:30:02
2022-07-21 14:56:44 [INFO]	[TRAIN] epoch: 1, iter: 210/20000, loss: 0.4847, lr: 0.009906, batch_cost: 0.2655, reader_cost: 0.10178, ips: 15.0678 samples/sec | ETA 01:27:33
2022-07-21 14:56:46 [INFO]	[TRAIN] epoch: 1, iter: 220/20000, loss: 0.5259, lr: 0.009901, batch_cost: 0.2679, reader_cost: 0.14123, ips: 14.9289 samples/sec | ETA 01:28:19
2022-07-21 14:56:49 [INFO]	[TRAIN] epoch: 1, iter: 230/20000, loss: 0.6713, lr: 0.009897, batch_cost: 0.2564, reader_cost: 0.12498, ips: 15.6030 samples/sec | ETA 01:24:28
2022-07-21 14:56:51 [INFO]	[TRAIN] epoch: 1, iter: 240/20000, loss: 0.7099, lr: 0.009892, batch_cost: 0.2456, reader_cost: 0.11903, ips: 16.2841 samples/sec | ETA 01:20:53
2022-07-21 14:56:54 [INFO]	[TRAIN] epoch: 1, iter: 250/20000, loss: 0.5256, lr: 0.009888, batch_cost: 0.3031, reader_cost: 0.16171, ips: 13.1978 samples/sec | ETA 01:39:45
2022-07-21 14:56:57 [INFO]	[TRAIN] epoch: 1, iter: 260/20000, loss: 0.6190, lr: 0.009883, batch_cost: 0.2621, reader_cost: 0.13200, ips: 15.2640 samples/sec | ETA 01:26:12
2022-07-21 14:57:01 [INFO]	[TRAIN] epoch: 1, iter: 270/20000, loss: 0.5735, lr: 0.009879, batch_cost: 0.3585, reader_cost: 0.18691, ips: 11.1586 samples/sec | ETA 01:57:52
2022-07-21 14:57:04 [INFO]	[TRAIN] epoch: 1, iter: 280/20000, loss: 0.3380, lr: 0.009874, batch_cost: 0.2897, reader_cost: 0.15261, ips: 13.8082 samples/sec | ETA 01:35:12
2022-07-21 14:57:06 [INFO]	[TRAIN] epoch: 1, iter: 290/20000, loss: 0.3988, lr: 0.009870, batch_cost: 0.2610, reader_cost: 0.13885, ips: 15.3271 samples/sec | ETA 01:25:43
2022-07-21 14:57:09 [INFO]	[TRAIN] epoch: 1, iter: 300/20000, loss: 0.7116, lr: 0.009865, batch_cost: 0.2562, reader_cost: 0.13377, ips: 15.6124 samples/sec | ETA 01:24:07
2022-07-21 14:57:11 [INFO]	[TRAIN] epoch: 1, iter: 310/20000, loss: 0.7189, lr: 0.009861, batch_cost: 0.2380, reader_cost: 0.11669, ips: 16.8041 samples/sec | ETA 01:18:06
2022-07-21 14:57:14 [INFO]	[TRAIN] epoch: 1, iter: 320/20000, loss: 0.5597, lr: 0.009856, batch_cost: 0.3014, reader_cost: 0.14422, ips: 13.2725 samples/sec | ETA 01:38:51
2022-07-21 14:57:17 [INFO]	[TRAIN] epoch: 1, iter: 330/20000, loss: 0.6489, lr: 0.009852, batch_cost: 0.2453, reader_cost: 0.12232, ips: 16.3059 samples/sec | ETA 01:20:25
2022-07-21 14:57:19 [INFO]	[TRAIN] epoch: 1, iter: 340/20000, loss: 0.5716, lr: 0.009847, batch_cost: 0.2472, reader_cost: 0.12221, ips: 16.1835 samples/sec | ETA 01:20:59
2022-07-21 14:57:21 [INFO]	[TRAIN] epoch: 1, iter: 350/20000, loss: 0.6402, lr: 0.009843, batch_cost: 0.2189, reader_cost: 0.09461, ips: 18.2729 samples/sec | ETA 01:11:41
2022-07-21 14:57:24 [INFO]	[TRAIN] epoch: 1, iter: 360/20000, loss: 0.5413, lr: 0.009838, batch_cost: 0.2359, reader_cost: 0.11353, ips: 16.9545 samples/sec | ETA 01:17:13
2022-07-21 14:57:26 [INFO]	[TRAIN] epoch: 1, iter: 370/20000, loss: 0.5669, lr: 0.009834, batch_cost: 0.2752, reader_cost: 0.14123, ips: 14.5330 samples/sec | ETA 01:30:02
2022-07-21 14:57:29 [INFO]	[TRAIN] epoch: 1, iter: 380/20000, loss: 0.4473, lr: 0.009829, batch_cost: 0.2407, reader_cost: 0.11707, ips: 16.6190 samples/sec | ETA 01:18:42
2022-07-21 14:57:31 [INFO]	[TRAIN] epoch: 1, iter: 390/20000, loss: 0.6943, lr: 0.009825, batch_cost: 0.2233, reader_cost: 0.09761, ips: 17.9126 samples/sec | ETA 01:12:59
2022-07-21 14:57:33 [INFO]	[TRAIN] epoch: 1, iter: 400/20000, loss: 0.5977, lr: 0.009820, batch_cost: 0.2307, reader_cost: 0.10906, ips: 17.3376 samples/sec | ETA 01:15:21
2022-07-21 14:57:36 [INFO]	[TRAIN] epoch: 1, iter: 410/20000, loss: 0.6317, lr: 0.009816, batch_cost: 0.2220, reader_cost: 0.09904, ips: 18.0181 samples/sec | ETA 01:12:28
2022-07-21 14:57:38 [INFO]	[TRAIN] epoch: 1, iter: 420/20000, loss: 0.5135, lr: 0.009811, batch_cost: 0.2344, reader_cost: 0.11267, ips: 17.0646 samples/sec | ETA 01:16:29
2022-07-21 14:57:40 [INFO]	[TRAIN] epoch: 1, iter: 430/20000, loss: 0.6564, lr: 0.009807, batch_cost: 0.2304, reader_cost: 0.10728, ips: 17.3593 samples/sec | ETA 01:15:09
2022-07-21 14:57:43 [INFO]	[TRAIN] epoch: 1, iter: 440/20000, loss: 0.4523, lr: 0.009802, batch_cost: 0.2406, reader_cost: 0.11929, ips: 16.6264 samples/sec | ETA 01:18:25
2022-07-21 14:57:45 [INFO]	[TRAIN] epoch: 1, iter: 450/20000, loss: 0.5845, lr: 0.009798, batch_cost: 0.2446, reader_cost: 0.11998, ips: 16.3550 samples/sec | ETA 01:19:41
2022-07-21 14:57:48 [INFO]	[TRAIN] epoch: 1, iter: 460/20000, loss: 0.5523, lr: 0.009793, batch_cost: 0.2455, reader_cost: 0.11657, ips: 16.2936 samples/sec | ETA 01:19:56
2022-07-21 14:57:50 [INFO]	[TRAIN] epoch: 1, iter: 470/20000, loss: 0.3911, lr: 0.009789, batch_cost: 0.2669, reader_cost: 0.14121, ips: 14.9849 samples/sec | ETA 01:26:53
2022-07-21 14:57:53 [INFO]	[TRAIN] epoch: 1, iter: 480/20000, loss: 0.7399, lr: 0.009784, batch_cost: 0.2524, reader_cost: 0.12457, ips: 15.8508 samples/sec | ETA 01:22:05
2022-07-21 14:57:55 [INFO]	[TRAIN] epoch: 1, iter: 490/20000, loss: 0.5473, lr: 0.009780, batch_cost: 0.2565, reader_cost: 0.13508, ips: 15.5927 samples/sec | ETA 01:23:24
2022-07-21 14:57:58 [INFO]	[TRAIN] epoch: 1, iter: 500/20000, loss: 0.6126, lr: 0.009775, batch_cost: 0.2894, reader_cost: 0.15135, ips: 13.8231 samples/sec | ETA 01:34:02
2022-07-21 14:58:01 [INFO]	[TRAIN] epoch: 1, iter: 510/20000, loss: 0.5861, lr: 0.009771, batch_cost: 0.2424, reader_cost: 0.11570, ips: 16.5007 samples/sec | ETA 01:18:44
2022-07-21 14:58:03 [INFO]	[TRAIN] epoch: 1, iter: 520/20000, loss: 0.3032, lr: 0.009766, batch_cost: 0.2433, reader_cost: 0.11946, ips: 16.4406 samples/sec | ETA 01:18:59
2022-07-21 14:58:06 [INFO]	[TRAIN] epoch: 1, iter: 530/20000, loss: 0.3768, lr: 0.009762, batch_cost: 0.3242, reader_cost: 0.16667, ips: 12.3394 samples/sec | ETA 01:45:11
2022-07-21 14:58:09 [INFO]	[TRAIN] epoch: 1, iter: 540/20000, loss: 0.4011, lr: 0.009757, batch_cost: 0.3133, reader_cost: 0.15552, ips: 12.7685 samples/sec | ETA 01:41:36
2022-07-21 14:58:12 [INFO]	[TRAIN] epoch: 1, iter: 550/20000, loss: 0.5117, lr: 0.009753, batch_cost: 0.2322, reader_cost: 0.11025, ips: 17.2295 samples/sec | ETA 01:15:15
2022-07-21 14:58:14 [INFO]	[TRAIN] epoch: 1, iter: 560/20000, loss: 0.5166, lr: 0.009748, batch_cost: 0.2291, reader_cost: 0.10920, ips: 17.4609 samples/sec | ETA 01:14:13
2022-07-21 14:58:17 [INFO]	[TRAIN] epoch: 1, iter: 570/20000, loss: 0.6538, lr: 0.009744, batch_cost: 0.2840, reader_cost: 0.14188, ips: 14.0847 samples/sec | ETA 01:31:58
2022-07-21 14:58:19 [INFO]	[TRAIN] epoch: 1, iter: 580/20000, loss: 0.6845, lr: 0.009739, batch_cost: 0.2429, reader_cost: 0.11783, ips: 16.4674 samples/sec | ETA 01:18:37
2022-07-21 14:58:22 [INFO]	[TRAIN] epoch: 1, iter: 590/20000, loss: 0.5049, lr: 0.009735, batch_cost: 0.2462, reader_cost: 0.12593, ips: 16.2477 samples/sec | ETA 01:19:38
2022-07-21 14:58:24 [INFO]	[TRAIN] epoch: 1, iter: 600/20000, loss: 0.3684, lr: 0.009730, batch_cost: 0.2445, reader_cost: 0.12355, ips: 16.3579 samples/sec | ETA 01:19:03
2022-07-21 14:58:27 [INFO]	[TRAIN] epoch: 1, iter: 610/20000, loss: 0.7003, lr: 0.009726, batch_cost: 0.2367, reader_cost: 0.11647, ips: 16.9025 samples/sec | ETA 01:16:28
2022-07-21 14:58:29 [INFO]	[TRAIN] epoch: 1, iter: 620/20000, loss: 0.5458, lr: 0.009721, batch_cost: 0.2403, reader_cost: 0.11208, ips: 16.6434 samples/sec | ETA 01:17:37
2022-07-21 14:58:32 [INFO]	[TRAIN] epoch: 1, iter: 630/20000, loss: 0.5631, lr: 0.009716, batch_cost: 0.2693, reader_cost: 0.11775, ips: 14.8525 samples/sec | ETA 01:26:56
2022-07-21 14:58:34 [INFO]	[TRAIN] epoch: 1, iter: 640/20000, loss: 0.3798, lr: 0.009712, batch_cost: 0.2325, reader_cost: 0.11114, ips: 17.2057 samples/sec | ETA 01:15:00
2022-07-21 14:58:36 [INFO]	[TRAIN] epoch: 1, iter: 650/20000, loss: 0.5365, lr: 0.009707, batch_cost: 0.2208, reader_cost: 0.09945, ips: 18.1170 samples/sec | ETA 01:11:12
2022-07-21 14:58:39 [INFO]	[TRAIN] epoch: 1, iter: 660/20000, loss: 0.4280, lr: 0.009703, batch_cost: 0.2356, reader_cost: 0.11444, ips: 16.9788 samples/sec | ETA 01:15:56
2022-07-21 14:58:41 [INFO]	[TRAIN] epoch: 1, iter: 670/20000, loss: 0.4687, lr: 0.009698, batch_cost: 0.2433, reader_cost: 0.11879, ips: 16.4401 samples/sec | ETA 01:18:23
2022-07-21 14:58:43 [INFO]	[TRAIN] epoch: 1, iter: 680/20000, loss: 0.6834, lr: 0.009694, batch_cost: 0.2437, reader_cost: 0.11726, ips: 16.4120 samples/sec | ETA 01:18:28
2022-07-21 14:58:46 [INFO]	[TRAIN] epoch: 1, iter: 690/20000, loss: 0.4881, lr: 0.009689, batch_cost: 0.2473, reader_cost: 0.12316, ips: 16.1777 samples/sec | ETA 01:19:34
2022-07-21 14:58:48 [INFO]	[TRAIN] epoch: 1, iter: 700/20000, loss: 0.6050, lr: 0.009685, batch_cost: 0.2426, reader_cost: 0.11743, ips: 16.4852 samples/sec | ETA 01:18:02
2022-07-21 14:58:51 [INFO]	[TRAIN] epoch: 1, iter: 710/20000, loss: 0.5325, lr: 0.009680, batch_cost: 0.2519, reader_cost: 0.12577, ips: 15.8790 samples/sec | ETA 01:20:59
2022-07-21 14:58:53 [INFO]	[TRAIN] epoch: 1, iter: 720/20000, loss: 0.5232, lr: 0.009676, batch_cost: 0.2441, reader_cost: 0.11157, ips: 16.3834 samples/sec | ETA 01:18:27
2022-07-21 14:58:56 [INFO]	[TRAIN] epoch: 1, iter: 730/20000, loss: 0.5826, lr: 0.009671, batch_cost: 0.2420, reader_cost: 0.11868, ips: 16.5268 samples/sec | ETA 01:17:43
2022-07-21 14:58:58 [INFO]	[TRAIN] epoch: 1, iter: 740/20000, loss: 0.4908, lr: 0.009667, batch_cost: 0.2646, reader_cost: 0.14021, ips: 15.1149 samples/sec | ETA 01:24:56
2022-07-21 14:59:01 [INFO]	[TRAIN] epoch: 1, iter: 750/20000, loss: 0.4985, lr: 0.009662, batch_cost: 0.2810, reader_cost: 0.14470, ips: 14.2354 samples/sec | ETA 01:30:09
2022-07-21 14:59:04 [INFO]	[TRAIN] epoch: 1, iter: 760/20000, loss: 0.6042, lr: 0.009658, batch_cost: 0.2612, reader_cost: 0.13136, ips: 15.3153 samples/sec | ETA 01:23:45
2022-07-21 14:59:06 [INFO]	[TRAIN] epoch: 1, iter: 770/20000, loss: 0.4807, lr: 0.009653, batch_cost: 0.2289, reader_cost: 0.10698, ips: 17.4742 samples/sec | ETA 01:13:21
2022-07-21 14:59:09 [INFO]	[TRAIN] epoch: 1, iter: 780/20000, loss: 0.5735, lr: 0.009649, batch_cost: 0.2435, reader_cost: 0.11190, ips: 16.4280 samples/sec | ETA 01:17:59
2022-07-21 14:59:11 [INFO]	[TRAIN] epoch: 1, iter: 790/20000, loss: 0.6511, lr: 0.009644, batch_cost: 0.2678, reader_cost: 0.13647, ips: 14.9342 samples/sec | ETA 01:25:45
2022-07-21 14:59:15 [INFO]	[TRAIN] epoch: 1, iter: 800/20000, loss: 0.4552, lr: 0.009640, batch_cost: 0.3913, reader_cost: 0.21311, ips: 10.2212 samples/sec | ETA 02:05:13
2022-07-21 14:59:18 [INFO]	[TRAIN] epoch: 1, iter: 810/20000, loss: 0.5345, lr: 0.009635, batch_cost: 0.2880, reader_cost: 0.14126, ips: 13.8907 samples/sec | ETA 01:32:05
2022-07-21 14:59:20 [INFO]	[TRAIN] epoch: 1, iter: 820/20000, loss: 0.5315, lr: 0.009631, batch_cost: 0.2487, reader_cost: 0.12208, ips: 16.0824 samples/sec | ETA 01:19:30
2022-07-21 14:59:23 [INFO]	[TRAIN] epoch: 1, iter: 830/20000, loss: 0.5320, lr: 0.009626, batch_cost: 0.2360, reader_cost: 0.11429, ips: 16.9456 samples/sec | ETA 01:15:25
2022-07-21 14:59:25 [INFO]	[TRAIN] epoch: 1, iter: 840/20000, loss: 0.5432, lr: 0.009622, batch_cost: 0.2216, reader_cost: 0.09770, ips: 18.0506 samples/sec | ETA 01:10:45
2022-07-21 14:59:27 [INFO]	[TRAIN] epoch: 1, iter: 850/20000, loss: 0.3951, lr: 0.009617, batch_cost: 0.2428, reader_cost: 0.12152, ips: 16.4753 samples/sec | ETA 01:17:29
2022-07-21 14:59:30 [INFO]	[TRAIN] epoch: 1, iter: 860/20000, loss: 0.5536, lr: 0.009613, batch_cost: 0.2441, reader_cost: 0.11412, ips: 16.3874 samples/sec | ETA 01:17:51
2022-07-21 14:59:32 [INFO]	[TRAIN] epoch: 1, iter: 870/20000, loss: 0.5246, lr: 0.009608, batch_cost: 0.2398, reader_cost: 0.11729, ips: 16.6810 samples/sec | ETA 01:16:27
2022-07-21 14:59:35 [INFO]	[TRAIN] epoch: 1, iter: 880/20000, loss: 0.6179, lr: 0.009604, batch_cost: 0.2881, reader_cost: 0.14870, ips: 13.8850 samples/sec | ETA 01:31:48
2022-07-21 14:59:37 [INFO]	[TRAIN] epoch: 1, iter: 890/20000, loss: 0.5340, lr: 0.009599, batch_cost: 0.2247, reader_cost: 0.10413, ips: 17.8045 samples/sec | ETA 01:11:33
2022-07-21 14:59:40 [INFO]	[TRAIN] epoch: 1, iter: 900/20000, loss: 0.5292, lr: 0.009595, batch_cost: 0.2436, reader_cost: 0.11777, ips: 16.4173 samples/sec | ETA 01:17:33
2022-07-21 14:59:42 [INFO]	[TRAIN] epoch: 1, iter: 910/20000, loss: 0.4689, lr: 0.009590, batch_cost: 0.2211, reader_cost: 0.09907, ips: 18.0935 samples/sec | ETA 01:10:20
2022-07-21 14:59:45 [INFO]	[TRAIN] epoch: 1, iter: 920/20000, loss: 0.3855, lr: 0.009585, batch_cost: 0.2453, reader_cost: 0.11874, ips: 16.3062 samples/sec | ETA 01:18:00
2022-07-21 14:59:47 [INFO]	[TRAIN] epoch: 1, iter: 930/20000, loss: 0.4187, lr: 0.009581, batch_cost: 0.2500, reader_cost: 0.12157, ips: 16.0022 samples/sec | ETA 01:19:26
2022-07-21 14:59:50 [INFO]	[TRAIN] epoch: 1, iter: 940/20000, loss: 0.4044, lr: 0.009576, batch_cost: 0.2540, reader_cost: 0.12903, ips: 15.7474 samples/sec | ETA 01:20:41
2022-07-21 14:59:52 [INFO]	[TRAIN] epoch: 1, iter: 950/20000, loss: 0.6536, lr: 0.009572, batch_cost: 0.2217, reader_cost: 0.09806, ips: 18.0384 samples/sec | ETA 01:10:24
2022-07-21 14:59:54 [INFO]	[TRAIN] epoch: 1, iter: 960/20000, loss: 0.4353, lr: 0.009567, batch_cost: 0.2398, reader_cost: 0.11553, ips: 16.6803 samples/sec | ETA 01:16:05
2022-07-21 14:59:57 [INFO]	[TRAIN] epoch: 1, iter: 970/20000, loss: 0.4946, lr: 0.009563, batch_cost: 0.2392, reader_cost: 0.11176, ips: 16.7224 samples/sec | ETA 01:15:51
2022-07-21 14:59:59 [INFO]	[TRAIN] epoch: 1, iter: 980/20000, loss: 0.5284, lr: 0.009558, batch_cost: 0.2553, reader_cost: 0.12890, ips: 15.6654 samples/sec | ETA 01:20:56
2022-07-21 15:00:02 [INFO]	[TRAIN] epoch: 1, iter: 990/20000, loss: 0.4630, lr: 0.009554, batch_cost: 0.2495, reader_cost: 0.11552, ips: 16.0318 samples/sec | ETA 01:19:03
2022-07-21 15:00:04 [INFO]	[TRAIN] epoch: 1, iter: 1000/20000, loss: 0.4146, lr: 0.009549, batch_cost: 0.2474, reader_cost: 0.11718, ips: 16.1670 samples/sec | ETA 01:18:20
2022-07-21 15:00:07 [INFO]	[TRAIN] epoch: 1, iter: 1010/20000, loss: 0.4732, lr: 0.009545, batch_cost: 0.2937, reader_cost: 0.15381, ips: 13.6211 samples/sec | ETA 01:32:56
2022-07-21 15:00:09 [INFO]	[TRAIN] epoch: 1, iter: 1020/20000, loss: 0.4508, lr: 0.009540, batch_cost: 0.2254, reader_cost: 0.10488, ips: 17.7483 samples/sec | ETA 01:11:17
2022-07-21 15:00:12 [INFO]	[TRAIN] epoch: 1, iter: 1030/20000, loss: 0.5881, lr: 0.009536, batch_cost: 0.2452, reader_cost: 0.12463, ips: 16.3134 samples/sec | ETA 01:17:31
2022-07-21 15:00:14 [INFO]	[TRAIN] epoch: 1, iter: 1040/20000, loss: 0.5697, lr: 0.009531, batch_cost: 0.2504, reader_cost: 0.12768, ips: 15.9728 samples/sec | ETA 01:19:08
2022-07-21 15:00:17 [INFO]	[TRAIN] epoch: 1, iter: 1050/20000, loss: 0.5305, lr: 0.009527, batch_cost: 0.2431, reader_cost: 0.12030, ips: 16.4538 samples/sec | ETA 01:16:46
2022-07-21 15:00:21 [INFO]	[TRAIN] epoch: 1, iter: 1060/20000, loss: 0.6297, lr: 0.009522, batch_cost: 0.3790, reader_cost: 0.19315, ips: 10.5533 samples/sec | ETA 01:59:38
2022-07-21 15:00:24 [INFO]	[TRAIN] epoch: 1, iter: 1070/20000, loss: 0.4769, lr: 0.009518, batch_cost: 0.3346, reader_cost: 0.16908, ips: 11.9557 samples/sec | ETA 01:45:33
2022-07-21 15:00:26 [INFO]	[TRAIN] epoch: 1, iter: 1080/20000, loss: 0.4034, lr: 0.009513, batch_cost: 0.2279, reader_cost: 0.10339, ips: 17.5551 samples/sec | ETA 01:11:50
2022-07-21 15:00:28 [INFO]	[TRAIN] epoch: 1, iter: 1090/20000, loss: 0.4825, lr: 0.009509, batch_cost: 0.2192, reader_cost: 0.09771, ips: 18.2467 samples/sec | ETA 01:09:05
2022-07-21 15:00:31 [INFO]	[TRAIN] epoch: 1, iter: 1100/20000, loss: 0.3385, lr: 0.009504, batch_cost: 0.2352, reader_cost: 0.11049, ips: 17.0073 samples/sec | ETA 01:14:05
2022-07-21 15:00:33 [INFO]	[TRAIN] epoch: 1, iter: 1110/20000, loss: 0.4106, lr: 0.009500, batch_cost: 0.2282, reader_cost: 0.10406, ips: 17.5322 samples/sec | ETA 01:11:49
2022-07-21 15:00:35 [INFO]	[TRAIN] epoch: 1, iter: 1120/20000, loss: 0.4325, lr: 0.009495, batch_cost: 0.2330, reader_cost: 0.11247, ips: 17.1680 samples/sec | ETA 01:13:18
2022-07-21 15:00:38 [INFO]	[TRAIN] epoch: 1, iter: 1130/20000, loss: 0.5360, lr: 0.009490, batch_cost: 0.2789, reader_cost: 0.14722, ips: 14.3424 samples/sec | ETA 01:27:42
2022-07-21 15:00:41 [INFO]	[TRAIN] epoch: 1, iter: 1140/20000, loss: 0.4442, lr: 0.009486, batch_cost: 0.2589, reader_cost: 0.13046, ips: 15.4507 samples/sec | ETA 01:21:22
2022-07-21 15:00:43 [INFO]	[TRAIN] epoch: 1, iter: 1150/20000, loss: 0.4827, lr: 0.009481, batch_cost: 0.2802, reader_cost: 0.15269, ips: 14.2767 samples/sec | ETA 01:28:01
2022-07-21 15:00:46 [INFO]	[TRAIN] epoch: 1, iter: 1160/20000, loss: 0.4974, lr: 0.009477, batch_cost: 0.2409, reader_cost: 0.11418, ips: 16.6029 samples/sec | ETA 01:15:38
2022-07-21 15:00:48 [INFO]	[TRAIN] epoch: 1, iter: 1170/20000, loss: 0.4769, lr: 0.009472, batch_cost: 0.2298, reader_cost: 0.10773, ips: 17.4065 samples/sec | ETA 01:12:07
2022-07-21 15:00:50 [INFO]	[TRAIN] epoch: 1, iter: 1180/20000, loss: 0.3819, lr: 0.009468, batch_cost: 0.2293, reader_cost: 0.10535, ips: 17.4476 samples/sec | ETA 01:11:54
2022-07-21 15:00:53 [INFO]	[TRAIN] epoch: 1, iter: 1190/20000, loss: 0.6574, lr: 0.009463, batch_cost: 0.2347, reader_cost: 0.11229, ips: 17.0438 samples/sec | ETA 01:13:34
2022-07-21 15:00:55 [INFO]	[TRAIN] epoch: 1, iter: 1200/20000, loss: 0.4916, lr: 0.009459, batch_cost: 0.2399, reader_cost: 0.11714, ips: 16.6766 samples/sec | ETA 01:15:09
2022-07-21 15:00:58 [INFO]	[TRAIN] epoch: 1, iter: 1210/20000, loss: 0.5461, lr: 0.009454, batch_cost: 0.2457, reader_cost: 0.12399, ips: 16.2820 samples/sec | ETA 01:16:56
2022-07-21 15:01:00 [INFO]	[TRAIN] epoch: 1, iter: 1220/20000, loss: 0.4712, lr: 0.009450, batch_cost: 0.2203, reader_cost: 0.09798, ips: 18.1535 samples/sec | ETA 01:08:58
2022-07-21 15:01:02 [INFO]	[TRAIN] epoch: 1, iter: 1230/20000, loss: 0.4156, lr: 0.009445, batch_cost: 0.2464, reader_cost: 0.12192, ips: 16.2358 samples/sec | ETA 01:17:04
2022-07-21 15:01:05 [INFO]	[TRAIN] epoch: 1, iter: 1240/20000, loss: 0.5316, lr: 0.009441, batch_cost: 0.2317, reader_cost: 0.10498, ips: 17.2663 samples/sec | ETA 01:12:26
2022-07-21 15:01:07 [INFO]	[TRAIN] epoch: 1, iter: 1250/20000, loss: 0.4818, lr: 0.009436, batch_cost: 0.2443, reader_cost: 0.11366, ips: 16.3757 samples/sec | ETA 01:16:19
2022-07-21 15:01:10 [INFO]	[TRAIN] epoch: 1, iter: 1260/20000, loss: 0.4698, lr: 0.009432, batch_cost: 0.2878, reader_cost: 0.13636, ips: 13.8999 samples/sec | ETA 01:29:52
2022-07-21 15:01:12 [INFO]	[TRAIN] epoch: 1, iter: 1270/20000, loss: 0.4265, lr: 0.009427, batch_cost: 0.2383, reader_cost: 0.11406, ips: 16.7835 samples/sec | ETA 01:14:23
2022-07-21 15:01:15 [INFO]	[TRAIN] epoch: 1, iter: 1280/20000, loss: 0.5951, lr: 0.009423, batch_cost: 0.2522, reader_cost: 0.12792, ips: 15.8574 samples/sec | ETA 01:18:42
2022-07-21 15:01:17 [INFO]	[TRAIN] epoch: 1, iter: 1290/20000, loss: 0.3429, lr: 0.009418, batch_cost: 0.2255, reader_cost: 0.10364, ips: 17.7360 samples/sec | ETA 01:10:19
2022-07-21 15:01:20 [INFO]	[TRAIN] epoch: 1, iter: 1300/20000, loss: 0.3973, lr: 0.009414, batch_cost: 0.2432, reader_cost: 0.11715, ips: 16.4474 samples/sec | ETA 01:15:47
2022-07-21 15:01:22 [INFO]	[TRAIN] epoch: 1, iter: 1310/20000, loss: 0.4413, lr: 0.009409, batch_cost: 0.2407, reader_cost: 0.11528, ips: 16.6167 samples/sec | ETA 01:14:59
2022-07-21 15:01:25 [INFO]	[TRAIN] epoch: 1, iter: 1320/20000, loss: 0.4925, lr: 0.009404, batch_cost: 0.2979, reader_cost: 0.13995, ips: 13.4276 samples/sec | ETA 01:32:44
2022-07-21 15:01:29 [INFO]	[TRAIN] epoch: 1, iter: 1330/20000, loss: 0.5488, lr: 0.009400, batch_cost: 0.3533, reader_cost: 0.16668, ips: 11.3203 samples/sec | ETA 01:49:56
2022-07-21 15:01:31 [INFO]	[TRAIN] epoch: 1, iter: 1340/20000, loss: 0.4812, lr: 0.009395, batch_cost: 0.2705, reader_cost: 0.13960, ips: 14.7887 samples/sec | ETA 01:24:07
2022-07-21 15:01:34 [INFO]	[TRAIN] epoch: 1, iter: 1350/20000, loss: 0.5172, lr: 0.009391, batch_cost: 0.2422, reader_cost: 0.11986, ips: 16.5181 samples/sec | ETA 01:15:16
2022-07-21 15:01:36 [INFO]	[TRAIN] epoch: 1, iter: 1360/20000, loss: 0.3980, lr: 0.009386, batch_cost: 0.2518, reader_cost: 0.13068, ips: 15.8856 samples/sec | ETA 01:18:13
2022-07-21 15:01:38 [INFO]	[TRAIN] epoch: 1, iter: 1370/20000, loss: 0.3723, lr: 0.009382, batch_cost: 0.2300, reader_cost: 0.10781, ips: 17.3912 samples/sec | ETA 01:11:24
2022-07-21 15:01:41 [INFO]	[TRAIN] epoch: 1, iter: 1380/20000, loss: 0.4541, lr: 0.009377, batch_cost: 0.2837, reader_cost: 0.13631, ips: 14.1019 samples/sec | ETA 01:28:01
2022-07-21 15:01:44 [INFO]	[TRAIN] epoch: 1, iter: 1390/20000, loss: 0.4767, lr: 0.009373, batch_cost: 0.2392, reader_cost: 0.11585, ips: 16.7225 samples/sec | ETA 01:14:11
2022-07-21 15:01:46 [INFO]	[TRAIN] epoch: 1, iter: 1400/20000, loss: 0.3933, lr: 0.009368, batch_cost: 0.2449, reader_cost: 0.11558, ips: 16.3356 samples/sec | ETA 01:15:54
2022-07-21 15:01:48 [INFO]	[TRAIN] epoch: 1, iter: 1410/20000, loss: 0.4505, lr: 0.009364, batch_cost: 0.2324, reader_cost: 0.11003, ips: 17.2099 samples/sec | ETA 01:12:00
2022-07-21 15:01:51 [INFO]	[TRAIN] epoch: 1, iter: 1420/20000, loss: 0.5567, lr: 0.009359, batch_cost: 0.2350, reader_cost: 0.11300, ips: 17.0214 samples/sec | ETA 01:12:46
2022-07-21 15:01:53 [INFO]	[TRAIN] epoch: 1, iter: 1430/20000, loss: 0.5346, lr: 0.009355, batch_cost: 0.2317, reader_cost: 0.11083, ips: 17.2664 samples/sec | ETA 01:11:42
2022-07-21 15:01:56 [INFO]	[TRAIN] epoch: 1, iter: 1440/20000, loss: 0.3221, lr: 0.009350, batch_cost: 0.2409, reader_cost: 0.11926, ips: 16.6020 samples/sec | ETA 01:14:31
2022-07-21 15:01:58 [INFO]	[TRAIN] epoch: 1, iter: 1450/20000, loss: 0.3976, lr: 0.009346, batch_cost: 0.2317, reader_cost: 0.11015, ips: 17.2617 samples/sec | ETA 01:11:38
2022-07-21 15:02:00 [INFO]	[TRAIN] epoch: 1, iter: 1460/20000, loss: 0.4399, lr: 0.009341, batch_cost: 0.2346, reader_cost: 0.10962, ips: 17.0511 samples/sec | ETA 01:12:29
2022-07-21 15:02:03 [INFO]	[TRAIN] epoch: 1, iter: 1470/20000, loss: 0.4112, lr: 0.009336, batch_cost: 0.2401, reader_cost: 0.11930, ips: 16.6624 samples/sec | ETA 01:14:08
2022-07-21 15:02:05 [INFO]	[TRAIN] epoch: 1, iter: 1480/20000, loss: 0.3547, lr: 0.009332, batch_cost: 0.2274, reader_cost: 0.10579, ips: 17.5871 samples/sec | ETA 01:10:12
2022-07-21 15:02:07 [INFO]	[TRAIN] epoch: 1, iter: 1490/20000, loss: 0.4614, lr: 0.009327, batch_cost: 0.2467, reader_cost: 0.11496, ips: 16.2169 samples/sec | ETA 01:16:05
2022-07-21 15:02:10 [INFO]	[TRAIN] epoch: 1, iter: 1500/20000, loss: 0.4686, lr: 0.009323, batch_cost: 0.2370, reader_cost: 0.07827, ips: 16.8744 samples/sec | ETA 01:13:05
2022-07-21 15:02:12 [INFO]	[TRAIN] epoch: 1, iter: 1510/20000, loss: 0.3991, lr: 0.009318, batch_cost: 0.2656, reader_cost: 0.12437, ips: 15.0627 samples/sec | ETA 01:21:50
2022-07-21 15:02:15 [INFO]	[TRAIN] epoch: 1, iter: 1520/20000, loss: 0.5329, lr: 0.009314, batch_cost: 0.2597, reader_cost: 0.12597, ips: 15.4041 samples/sec | ETA 01:19:58
2022-07-21 15:02:17 [INFO]	[TRAIN] epoch: 1, iter: 1530/20000, loss: 0.4557, lr: 0.009309, batch_cost: 0.2351, reader_cost: 0.11179, ips: 17.0172 samples/sec | ETA 01:12:21
2022-07-21 15:02:20 [INFO]	[TRAIN] epoch: 1, iter: 1540/20000, loss: 0.4484, lr: 0.009305, batch_cost: 0.2414, reader_cost: 0.11561, ips: 16.5687 samples/sec | ETA 01:14:16
2022-07-21 15:02:22 [INFO]	[TRAIN] epoch: 1, iter: 1550/20000, loss: 0.4284, lr: 0.009300, batch_cost: 0.2361, reader_cost: 0.11385, ips: 16.9391 samples/sec | ETA 01:12:36
2022-07-21 15:02:25 [INFO]	[TRAIN] epoch: 1, iter: 1560/20000, loss: 0.2956, lr: 0.009296, batch_cost: 0.2505, reader_cost: 0.12430, ips: 15.9669 samples/sec | ETA 01:16:59
2022-07-21 15:02:27 [INFO]	[TRAIN] epoch: 1, iter: 1570/20000, loss: 0.4430, lr: 0.009291, batch_cost: 0.2752, reader_cost: 0.13904, ips: 14.5354 samples/sec | ETA 01:24:31
2022-07-21 15:02:30 [INFO]	[TRAIN] epoch: 1, iter: 1580/20000, loss: 0.4362, lr: 0.009287, batch_cost: 0.2415, reader_cost: 0.11759, ips: 16.5609 samples/sec | ETA 01:14:09
2022-07-21 15:02:32 [INFO]	[TRAIN] epoch: 1, iter: 1590/20000, loss: 0.5054, lr: 0.009282, batch_cost: 0.2598, reader_cost: 0.12305, ips: 15.3962 samples/sec | ETA 01:19:42
2022-07-21 15:02:36 [INFO]	[TRAIN] epoch: 1, iter: 1600/20000, loss: 0.3720, lr: 0.009277, batch_cost: 0.3773, reader_cost: 0.19223, ips: 10.6007 samples/sec | ETA 01:55:42
2022-07-21 15:02:39 [INFO]	[TRAIN] epoch: 1, iter: 1610/20000, loss: 0.5518, lr: 0.009273, batch_cost: 0.2356, reader_cost: 0.11445, ips: 16.9774 samples/sec | ETA 01:12:12
2022-07-21 15:02:41 [INFO]	[TRAIN] epoch: 1, iter: 1620/20000, loss: 0.4688, lr: 0.009268, batch_cost: 0.2354, reader_cost: 0.11329, ips: 16.9929 samples/sec | ETA 01:12:06
2022-07-21 15:02:43 [INFO]	[TRAIN] epoch: 1, iter: 1630/20000, loss: 0.5300, lr: 0.009264, batch_cost: 0.2420, reader_cost: 0.11269, ips: 16.5280 samples/sec | ETA 01:14:05
2022-07-21 15:02:46 [INFO]	[TRAIN] epoch: 1, iter: 1640/20000, loss: 0.5341, lr: 0.009259, batch_cost: 0.2995, reader_cost: 0.15607, ips: 13.3563 samples/sec | ETA 01:31:38
2022-07-21 15:02:49 [INFO]	[TRAIN] epoch: 1, iter: 1650/20000, loss: 0.5649, lr: 0.009255, batch_cost: 0.2306, reader_cost: 0.10422, ips: 17.3478 samples/sec | ETA 01:10:31
2022-07-21 15:02:51 [INFO]	[TRAIN] epoch: 1, iter: 1660/20000, loss: 0.4397, lr: 0.009250, batch_cost: 0.2425, reader_cost: 0.12138, ips: 16.4959 samples/sec | ETA 01:14:07
2022-07-21 15:02:53 [INFO]	[TRAIN] epoch: 1, iter: 1670/20000, loss: 0.4640, lr: 0.009246, batch_cost: 0.2126, reader_cost: 0.08875, ips: 18.8119 samples/sec | ETA 01:04:57
2022-07-21 15:02:55 [INFO]	[TRAIN] epoch: 1, iter: 1680/20000, loss: 0.3646, lr: 0.009241, batch_cost: 0.2170, reader_cost: 0.09464, ips: 18.4297 samples/sec | ETA 01:06:16
2022-07-21 15:02:58 [INFO]	[TRAIN] epoch: 1, iter: 1690/20000, loss: 0.4645, lr: 0.009237, batch_cost: 0.2354, reader_cost: 0.11311, ips: 16.9925 samples/sec | ETA 01:11:50
2022-07-21 15:03:00 [INFO]	[TRAIN] epoch: 1, iter: 1700/20000, loss: 0.5172, lr: 0.009232, batch_cost: 0.2416, reader_cost: 0.11691, ips: 16.5581 samples/sec | ETA 01:13:40
2022-07-21 15:03:02 [INFO]	[TRAIN] epoch: 1, iter: 1710/20000, loss: 0.4305, lr: 0.009228, batch_cost: 0.2286, reader_cost: 0.10740, ips: 17.4983 samples/sec | ETA 01:09:40
2022-07-21 15:03:05 [INFO]	[TRAIN] epoch: 1, iter: 1720/20000, loss: 0.4329, lr: 0.009223, batch_cost: 0.2426, reader_cost: 0.12253, ips: 16.4892 samples/sec | ETA 01:13:54
2022-07-21 15:03:07 [INFO]	[TRAIN] epoch: 1, iter: 1730/20000, loss: 0.5170, lr: 0.009218, batch_cost: 0.2228, reader_cost: 0.10112, ips: 17.9545 samples/sec | ETA 01:07:50
2022-07-21 15:03:09 [INFO]	[TRAIN] epoch: 1, iter: 1740/20000, loss: 0.5246, lr: 0.009214, batch_cost: 0.2341, reader_cost: 0.11167, ips: 17.0862 samples/sec | ETA 01:11:14
2022-07-21 15:03:12 [INFO]	[TRAIN] epoch: 1, iter: 1750/20000, loss: 0.3562, lr: 0.009209, batch_cost: 0.2459, reader_cost: 0.11569, ips: 16.2693 samples/sec | ETA 01:14:46
2022-07-21 15:03:14 [INFO]	[TRAIN] epoch: 1, iter: 1760/20000, loss: 0.5561, lr: 0.009205, batch_cost: 0.2534, reader_cost: 0.13036, ips: 15.7823 samples/sec | ETA 01:17:02
2022-07-21 15:03:18 [INFO]	[TRAIN] epoch: 1, iter: 1770/20000, loss: 0.4291, lr: 0.009200, batch_cost: 0.3164, reader_cost: 0.16638, ips: 12.6435 samples/sec | ETA 01:36:07
2022-07-21 15:03:20 [INFO]	[TRAIN] epoch: 1, iter: 1780/20000, loss: 0.4387, lr: 0.009196, batch_cost: 0.2527, reader_cost: 0.12658, ips: 15.8263 samples/sec | ETA 01:16:45
2022-07-21 15:03:22 [INFO]	[TRAIN] epoch: 1, iter: 1790/20000, loss: 0.4254, lr: 0.009191, batch_cost: 0.2394, reader_cost: 0.11477, ips: 16.7106 samples/sec | ETA 01:12:38
2022-07-21 15:03:25 [INFO]	[TRAIN] epoch: 1, iter: 1800/20000, loss: 0.3958, lr: 0.009187, batch_cost: 0.2333, reader_cost: 0.11257, ips: 17.1445 samples/sec | ETA 01:10:46
2022-07-21 15:03:27 [INFO]	[TRAIN] epoch: 1, iter: 1810/20000, loss: 0.5448, lr: 0.009182, batch_cost: 0.2694, reader_cost: 0.12892, ips: 14.8498 samples/sec | ETA 01:21:39
2022-07-21 15:03:30 [INFO]	[TRAIN] epoch: 1, iter: 1820/20000, loss: 0.3464, lr: 0.009178, batch_cost: 0.2447, reader_cost: 0.11292, ips: 16.3478 samples/sec | ETA 01:14:08
2022-07-21 15:03:32 [INFO]	[TRAIN] epoch: 1, iter: 1830/20000, loss: 0.4049, lr: 0.009173, batch_cost: 0.2338, reader_cost: 0.11191, ips: 17.1071 samples/sec | ETA 01:10:48
2022-07-21 15:03:35 [INFO]	[TRAIN] epoch: 1, iter: 1840/20000, loss: 0.3595, lr: 0.009169, batch_cost: 0.2430, reader_cost: 0.11532, ips: 16.4619 samples/sec | ETA 01:13:32
2022-07-21 15:03:37 [INFO]	[TRAIN] epoch: 1, iter: 1850/20000, loss: 0.3629, lr: 0.009164, batch_cost: 0.2482, reader_cost: 0.12273, ips: 16.1139 samples/sec | ETA 01:15:05
2022-07-21 15:03:40 [INFO]	[TRAIN] epoch: 1, iter: 1860/20000, loss: 0.4569, lr: 0.009159, batch_cost: 0.3002, reader_cost: 0.13980, ips: 13.3247 samples/sec | ETA 01:30:45
2022-07-21 15:03:43 [INFO]	[TRAIN] epoch: 1, iter: 1870/20000, loss: 0.4834, lr: 0.009155, batch_cost: 0.3170, reader_cost: 0.15557, ips: 12.6198 samples/sec | ETA 01:35:46
2022-07-21 15:03:46 [INFO]	[TRAIN] epoch: 1, iter: 1880/20000, loss: 0.4126, lr: 0.009150, batch_cost: 0.2451, reader_cost: 0.12095, ips: 16.3171 samples/sec | ETA 01:14:01
2022-07-21 15:03:48 [INFO]	[TRAIN] epoch: 1, iter: 1890/20000, loss: 0.4030, lr: 0.009146, batch_cost: 0.2472, reader_cost: 0.10706, ips: 16.1790 samples/sec | ETA 01:14:37
2022-07-21 15:03:51 [INFO]	[TRAIN] epoch: 1, iter: 1900/20000, loss: 0.5064, lr: 0.009141, batch_cost: 0.2380, reader_cost: 0.11292, ips: 16.8087 samples/sec | ETA 01:11:47
2022-07-21 15:03:53 [INFO]	[TRAIN] epoch: 1, iter: 1910/20000, loss: 0.5435, lr: 0.009137, batch_cost: 0.2442, reader_cost: 0.11924, ips: 16.3781 samples/sec | ETA 01:13:38
2022-07-21 15:03:55 [INFO]	[TRAIN] epoch: 1, iter: 1920/20000, loss: 0.4150, lr: 0.009132, batch_cost: 0.2287, reader_cost: 0.10521, ips: 17.4898 samples/sec | ETA 01:08:54
2022-07-21 15:03:58 [INFO]	[TRAIN] epoch: 1, iter: 1930/20000, loss: 0.4284, lr: 0.009128, batch_cost: 0.2304, reader_cost: 0.10437, ips: 17.3617 samples/sec | ETA 01:09:23
2022-07-21 15:04:00 [INFO]	[TRAIN] epoch: 1, iter: 1940/20000, loss: 0.3087, lr: 0.009123, batch_cost: 0.2290, reader_cost: 0.10486, ips: 17.4700 samples/sec | ETA 01:08:55
2022-07-21 15:04:02 [INFO]	[TRAIN] epoch: 1, iter: 1950/20000, loss: 0.3878, lr: 0.009119, batch_cost: 0.2342, reader_cost: 0.11147, ips: 17.0770 samples/sec | ETA 01:10:27
2022-07-21 15:04:05 [INFO]	[TRAIN] epoch: 1, iter: 1960/20000, loss: 0.4249, lr: 0.009114, batch_cost: 0.2468, reader_cost: 0.12314, ips: 16.2067 samples/sec | ETA 01:14:12
2022-07-21 15:04:08 [INFO]	[TRAIN] epoch: 1, iter: 1970/20000, loss: 0.3492, lr: 0.009109, batch_cost: 0.2698, reader_cost: 0.14259, ips: 14.8261 samples/sec | ETA 01:21:04
2022-07-21 15:04:10 [INFO]	[TRAIN] epoch: 1, iter: 1980/20000, loss: 0.3650, lr: 0.009105, batch_cost: 0.2613, reader_cost: 0.13450, ips: 15.3055 samples/sec | ETA 01:18:29
2022-07-21 15:04:12 [INFO]	[TRAIN] epoch: 1, iter: 1990/20000, loss: 0.4084, lr: 0.009100, batch_cost: 0.2212, reader_cost: 0.09364, ips: 18.0809 samples/sec | ETA 01:06:24
2022-07-21 15:04:15 [INFO]	[TRAIN] epoch: 1, iter: 2000/20000, loss: 0.6924, lr: 0.009096, batch_cost: 0.2393, reader_cost: 0.10859, ips: 16.7151 samples/sec | ETA 01:11:47
2022-07-21 15:04:17 [INFO]	[TRAIN] epoch: 1, iter: 2010/20000, loss: 0.3879, lr: 0.009091, batch_cost: 0.2466, reader_cost: 0.12393, ips: 16.2214 samples/sec | ETA 01:13:56
2022-07-21 15:04:20 [INFO]	[TRAIN] epoch: 1, iter: 2020/20000, loss: 0.4924, lr: 0.009087, batch_cost: 0.2507, reader_cost: 0.11846, ips: 15.9557 samples/sec | ETA 01:15:07
2022-07-21 15:04:23 [INFO]	[TRAIN] epoch: 1, iter: 2030/20000, loss: 0.6063, lr: 0.009082, batch_cost: 0.2864, reader_cost: 0.12909, ips: 13.9681 samples/sec | ETA 01:25:46
2022-07-21 15:04:25 [INFO]	[TRAIN] epoch: 1, iter: 2040/20000, loss: 0.3620, lr: 0.009078, batch_cost: 0.2553, reader_cost: 0.13009, ips: 15.6662 samples/sec | ETA 01:16:25
2022-07-21 15:04:28 [INFO]	[TRAIN] epoch: 1, iter: 2050/20000, loss: 0.4169, lr: 0.009073, batch_cost: 0.2528, reader_cost: 0.10433, ips: 15.8226 samples/sec | ETA 01:15:37
2022-07-21 15:04:31 [INFO]	[TRAIN] epoch: 1, iter: 2060/20000, loss: 0.4918, lr: 0.009068, batch_cost: 0.2956, reader_cost: 0.14172, ips: 13.5316 samples/sec | ETA 01:28:23
2022-07-21 15:04:33 [INFO]	[TRAIN] epoch: 1, iter: 2070/20000, loss: 0.3849, lr: 0.009064, batch_cost: 0.2323, reader_cost: 0.10801, ips: 17.2201 samples/sec | ETA 01:09:24
2022-07-21 15:04:35 [INFO]	[TRAIN] epoch: 1, iter: 2080/20000, loss: 0.4007, lr: 0.009059, batch_cost: 0.2286, reader_cost: 0.10636, ips: 17.4994 samples/sec | ETA 01:08:16
2022-07-21 15:04:37 [INFO]	[TRAIN] epoch: 1, iter: 2090/20000, loss: 0.5691, lr: 0.009055, batch_cost: 0.2250, reader_cost: 0.10220, ips: 17.7797 samples/sec | ETA 01:07:09
2022-07-21 15:04:40 [INFO]	[TRAIN] epoch: 1, iter: 2100/20000, loss: 0.4965, lr: 0.009050, batch_cost: 0.2605, reader_cost: 0.12591, ips: 15.3531 samples/sec | ETA 01:17:43
2022-07-21 15:04:43 [INFO]	[TRAIN] epoch: 1, iter: 2110/20000, loss: 0.5045, lr: 0.009046, batch_cost: 0.2604, reader_cost: 0.13304, ips: 15.3627 samples/sec | ETA 01:17:38
2022-07-21 15:04:45 [INFO]	[TRAIN] epoch: 1, iter: 2120/20000, loss: 0.6160, lr: 0.009041, batch_cost: 0.2611, reader_cost: 0.12662, ips: 15.3198 samples/sec | ETA 01:17:48
2022-07-21 15:04:50 [INFO]	[TRAIN] epoch: 1, iter: 2130/20000, loss: 0.4363, lr: 0.009037, batch_cost: 0.4411, reader_cost: 0.23197, ips: 9.0675 samples/sec | ETA 02:11:23
2022-07-21 15:04:53 [INFO]	[TRAIN] epoch: 1, iter: 2140/20000, loss: 0.4418, lr: 0.009032, batch_cost: 0.2883, reader_cost: 0.14741, ips: 13.8734 samples/sec | ETA 01:25:49
2022-07-21 15:04:55 [INFO]	[TRAIN] epoch: 1, iter: 2150/20000, loss: 0.3939, lr: 0.009028, batch_cost: 0.2399, reader_cost: 0.11678, ips: 16.6724 samples/sec | ETA 01:11:22
2022-07-21 15:04:58 [INFO]	[TRAIN] epoch: 1, iter: 2160/20000, loss: 0.3488, lr: 0.009023, batch_cost: 0.2571, reader_cost: 0.13286, ips: 15.5570 samples/sec | ETA 01:16:26
2022-07-21 15:05:00 [INFO]	[TRAIN] epoch: 1, iter: 2170/20000, loss: 0.6358, lr: 0.009018, batch_cost: 0.2494, reader_cost: 0.12771, ips: 16.0388 samples/sec | ETA 01:14:06
2022-07-21 15:05:02 [INFO]	[TRAIN] epoch: 1, iter: 2180/20000, loss: 0.6096, lr: 0.009014, batch_cost: 0.2212, reader_cost: 0.09842, ips: 18.0818 samples/sec | ETA 01:05:42
2022-07-21 15:05:05 [INFO]	[TRAIN] epoch: 1, iter: 2190/20000, loss: 0.4214, lr: 0.009009, batch_cost: 0.2344, reader_cost: 0.11323, ips: 17.0634 samples/sec | ETA 01:09:35
2022-07-21 15:05:07 [INFO]	[TRAIN] epoch: 1, iter: 2200/20000, loss: 0.4162, lr: 0.009005, batch_cost: 0.2320, reader_cost: 0.10736, ips: 17.2430 samples/sec | ETA 01:08:49
2022-07-21 15:05:09 [INFO]	[TRAIN] epoch: 1, iter: 2210/20000, loss: 0.3995, lr: 0.009000, batch_cost: 0.2348, reader_cost: 0.10915, ips: 17.0360 samples/sec | ETA 01:09:37
2022-07-21 15:05:12 [INFO]	[TRAIN] epoch: 1, iter: 2220/20000, loss: 0.3374, lr: 0.008996, batch_cost: 0.2410, reader_cost: 0.11705, ips: 16.5965 samples/sec | ETA 01:11:25
2022-07-21 15:05:14 [INFO]	[TRAIN] epoch: 1, iter: 2230/20000, loss: 0.5185, lr: 0.008991, batch_cost: 0.2355, reader_cost: 0.10940, ips: 16.9854 samples/sec | ETA 01:09:44
2022-07-21 15:05:16 [INFO]	[TRAIN] epoch: 1, iter: 2240/20000, loss: 0.4500, lr: 0.008987, batch_cost: 0.2272, reader_cost: 0.10501, ips: 17.6082 samples/sec | ETA 01:07:14
2022-07-21 15:05:19 [INFO]	[TRAIN] epoch: 1, iter: 2250/20000, loss: 0.7274, lr: 0.008982, batch_cost: 0.2196, reader_cost: 0.09743, ips: 18.2177 samples/sec | ETA 01:04:57
2022-07-21 15:05:21 [INFO]	[TRAIN] epoch: 1, iter: 2260/20000, loss: 0.3841, lr: 0.008977, batch_cost: 0.2435, reader_cost: 0.12147, ips: 16.4251 samples/sec | ETA 01:12:00
2022-07-21 15:05:24 [INFO]	[TRAIN] epoch: 1, iter: 2270/20000, loss: 0.3783, lr: 0.008973, batch_cost: 0.2637, reader_cost: 0.12557, ips: 15.1665 samples/sec | ETA 01:17:56
2022-07-21 15:05:26 [INFO]	[TRAIN] epoch: 1, iter: 2280/20000, loss: 0.4149, lr: 0.008968, batch_cost: 0.2567, reader_cost: 0.12468, ips: 15.5815 samples/sec | ETA 01:15:48
2022-07-21 15:05:29 [INFO]	[TRAIN] epoch: 1, iter: 2290/20000, loss: 0.4114, lr: 0.008964, batch_cost: 0.2483, reader_cost: 0.12077, ips: 16.1123 samples/sec | ETA 01:13:16
2022-07-21 15:05:31 [INFO]	[TRAIN] epoch: 1, iter: 2300/20000, loss: 0.3833, lr: 0.008959, batch_cost: 0.2748, reader_cost: 0.09942, ips: 14.5554 samples/sec | ETA 01:21:04
2022-07-21 15:05:34 [INFO]	[TRAIN] epoch: 1, iter: 2310/20000, loss: 0.3883, lr: 0.008955, batch_cost: 0.2737, reader_cost: 0.13252, ips: 14.6148 samples/sec | ETA 01:20:41
2022-07-21 15:05:36 [INFO]	[TRAIN] epoch: 1, iter: 2320/20000, loss: 0.4866, lr: 0.008950, batch_cost: 0.2333, reader_cost: 0.10720, ips: 17.1449 samples/sec | ETA 01:08:44
2022-07-21 15:05:39 [INFO]	[TRAIN] epoch: 1, iter: 2330/20000, loss: 0.5399, lr: 0.008946, batch_cost: 0.2499, reader_cost: 0.12264, ips: 16.0066 samples/sec | ETA 01:13:35
2022-07-21 15:05:41 [INFO]	[TRAIN] epoch: 1, iter: 2340/20000, loss: 0.5577, lr: 0.008941, batch_cost: 0.2511, reader_cost: 0.12407, ips: 15.9286 samples/sec | ETA 01:13:54
2022-07-21 15:05:44 [INFO]	[TRAIN] epoch: 1, iter: 2350/20000, loss: 0.3897, lr: 0.008936, batch_cost: 0.2436, reader_cost: 0.11524, ips: 16.4189 samples/sec | ETA 01:11:39
2022-07-21 15:05:47 [INFO]	[TRAIN] epoch: 1, iter: 2360/20000, loss: 0.4661, lr: 0.008932, batch_cost: 0.2605, reader_cost: 0.13014, ips: 15.3573 samples/sec | ETA 01:16:34
2022-07-21 15:05:49 [INFO]	[TRAIN] epoch: 1, iter: 2370/20000, loss: 0.4750, lr: 0.008927, batch_cost: 0.2347, reader_cost: 0.10929, ips: 17.0454 samples/sec | ETA 01:08:57
2022-07-21 15:05:51 [INFO]	[TRAIN] epoch: 1, iter: 2380/20000, loss: 0.2691, lr: 0.008923, batch_cost: 0.2381, reader_cost: 0.11472, ips: 16.7981 samples/sec | ETA 01:09:55
2022-07-21 15:05:55 [INFO]	[TRAIN] epoch: 1, iter: 2390/20000, loss: 0.3781, lr: 0.008918, batch_cost: 0.3452, reader_cost: 0.16137, ips: 11.5866 samples/sec | ETA 01:41:19
2022-07-21 15:05:58 [INFO]	[TRAIN] epoch: 1, iter: 2400/20000, loss: 0.4223, lr: 0.008914, batch_cost: 0.3499, reader_cost: 0.17834, ips: 11.4321 samples/sec | ETA 01:42:38
2022-07-21 15:06:00 [INFO]	[TRAIN] epoch: 1, iter: 2410/20000, loss: 0.4630, lr: 0.008909, batch_cost: 0.2278, reader_cost: 0.10040, ips: 17.5572 samples/sec | ETA 01:06:47
2022-07-21 15:06:03 [INFO]	[TRAIN] epoch: 1, iter: 2420/20000, loss: 0.5419, lr: 0.008905, batch_cost: 0.2332, reader_cost: 0.10913, ips: 17.1547 samples/sec | ETA 01:08:19
2022-07-21 15:06:05 [INFO]	[TRAIN] epoch: 1, iter: 2430/20000, loss: 0.3899, lr: 0.008900, batch_cost: 0.2256, reader_cost: 0.10341, ips: 17.7276 samples/sec | ETA 01:06:04
2022-07-21 15:06:07 [INFO]	[TRAIN] epoch: 1, iter: 2440/20000, loss: 0.4359, lr: 0.008895, batch_cost: 0.2207, reader_cost: 0.09881, ips: 18.1261 samples/sec | ETA 01:04:35
2022-07-21 15:06:10 [INFO]	[TRAIN] epoch: 1, iter: 2450/20000, loss: 0.4368, lr: 0.008891, batch_cost: 0.2248, reader_cost: 0.10436, ips: 17.7935 samples/sec | ETA 01:05:45
2022-07-21 15:06:12 [INFO]	[TRAIN] epoch: 1, iter: 2460/20000, loss: 0.3899, lr: 0.008886, batch_cost: 0.2440, reader_cost: 0.12158, ips: 16.3905 samples/sec | ETA 01:11:20
2022-07-21 15:06:14 [INFO]	[TRAIN] epoch: 1, iter: 2470/20000, loss: 0.3536, lr: 0.008882, batch_cost: 0.2332, reader_cost: 0.11277, ips: 17.1556 samples/sec | ETA 01:08:07
2022-07-21 15:06:17 [INFO]	[TRAIN] epoch: 1, iter: 2480/20000, loss: 0.4160, lr: 0.008877, batch_cost: 0.2426, reader_cost: 0.11513, ips: 16.4862 samples/sec | ETA 01:10:50
2022-07-21 15:06:19 [INFO]	[TRAIN] epoch: 1, iter: 2490/20000, loss: 0.4458, lr: 0.008873, batch_cost: 0.2346, reader_cost: 0.11269, ips: 17.0521 samples/sec | ETA 01:08:27
2022-07-21 15:06:22 [INFO]	[TRAIN] epoch: 1, iter: 2500/20000, loss: 0.4657, lr: 0.008868, batch_cost: 0.2490, reader_cost: 0.12594, ips: 16.0639 samples/sec | ETA 01:12:37
2022-07-21 15:06:24 [INFO]	[TRAIN] epoch: 1, iter: 2510/20000, loss: 0.3307, lr: 0.008864, batch_cost: 0.2478, reader_cost: 0.12426, ips: 16.1446 samples/sec | ETA 01:12:13
2022-07-21 15:06:27 [INFO]	[TRAIN] epoch: 1, iter: 2520/20000, loss: 0.4111, lr: 0.008859, batch_cost: 0.2522, reader_cost: 0.12310, ips: 15.8605 samples/sec | ETA 01:13:28
2022-07-21 15:06:29 [INFO]	[TRAIN] epoch: 1, iter: 2530/20000, loss: 0.4417, lr: 0.008854, batch_cost: 0.2727, reader_cost: 0.12989, ips: 14.6664 samples/sec | ETA 01:19:24
2022-07-21 15:06:32 [INFO]	[TRAIN] epoch: 1, iter: 2540/20000, loss: 0.5739, lr: 0.008850, batch_cost: 0.2613, reader_cost: 0.13658, ips: 15.3106 samples/sec | ETA 01:16:01
2022-07-21 15:06:35 [INFO]	[TRAIN] epoch: 1, iter: 2550/20000, loss: 0.5732, lr: 0.008845, batch_cost: 0.2929, reader_cost: 0.13852, ips: 13.6581 samples/sec | ETA 01:25:10
2022-07-21 15:06:37 [INFO]	[TRAIN] epoch: 1, iter: 2560/20000, loss: 0.4234, lr: 0.008841, batch_cost: 0.2373, reader_cost: 0.11543, ips: 16.8587 samples/sec | ETA 01:08:57
2022-07-21 15:06:40 [INFO]	[TRAIN] epoch: 1, iter: 2570/20000, loss: 0.4583, lr: 0.008836, batch_cost: 0.2364, reader_cost: 0.11282, ips: 16.9203 samples/sec | ETA 01:08:40
2022-07-21 15:06:42 [INFO]	[TRAIN] epoch: 1, iter: 2580/20000, loss: 0.5129, lr: 0.008832, batch_cost: 0.2552, reader_cost: 0.12716, ips: 15.6763 samples/sec | ETA 01:14:04
2022-07-21 15:06:44 [INFO]	[TRAIN] epoch: 1, iter: 2590/20000, loss: 0.3699, lr: 0.008827, batch_cost: 0.2327, reader_cost: 0.10832, ips: 17.1865 samples/sec | ETA 01:07:32
2022-07-21 15:06:47 [INFO]	[TRAIN] epoch: 1, iter: 2600/20000, loss: 0.3810, lr: 0.008822, batch_cost: 0.2442, reader_cost: 0.11336, ips: 16.3781 samples/sec | ETA 01:10:49
2022-07-21 15:06:49 [INFO]	[TRAIN] epoch: 1, iter: 2610/20000, loss: 0.4032, lr: 0.008818, batch_cost: 0.2366, reader_cost: 0.11340, ips: 16.9030 samples/sec | ETA 01:08:35
2022-07-21 15:06:52 [INFO]	[TRAIN] epoch: 1, iter: 2620/20000, loss: 0.4468, lr: 0.008813, batch_cost: 0.2522, reader_cost: 0.12869, ips: 15.8618 samples/sec | ETA 01:13:02
2022-07-21 15:06:54 [INFO]	[TRAIN] epoch: 1, iter: 2630/20000, loss: 0.4805, lr: 0.008809, batch_cost: 0.2518, reader_cost: 0.12643, ips: 15.8878 samples/sec | ETA 01:12:53
2022-07-21 15:06:57 [INFO]	[TRAIN] epoch: 1, iter: 2640/20000, loss: 0.4878, lr: 0.008804, batch_cost: 0.2213, reader_cost: 0.09903, ips: 18.0736 samples/sec | ETA 01:04:02
2022-07-21 15:06:59 [INFO]	[TRAIN] epoch: 1, iter: 2650/20000, loss: 0.5027, lr: 0.008800, batch_cost: 0.2269, reader_cost: 0.09784, ips: 17.6294 samples/sec | ETA 01:05:36
2022-07-21 15:07:03 [INFO]	[TRAIN] epoch: 1, iter: 2660/20000, loss: 0.3972, lr: 0.008795, batch_cost: 0.4306, reader_cost: 0.22012, ips: 9.2903 samples/sec | ETA 02:04:25
2022-07-21 15:07:06 [INFO]	[TRAIN] epoch: 1, iter: 2670/20000, loss: 0.3846, lr: 0.008791, batch_cost: 0.2658, reader_cost: 0.13217, ips: 15.0480 samples/sec | ETA 01:16:46
2022-07-21 15:07:08 [INFO]	[TRAIN] epoch: 1, iter: 2680/20000, loss: 0.5337, lr: 0.008786, batch_cost: 0.2248, reader_cost: 0.10269, ips: 17.7968 samples/sec | ETA 01:04:52
2022-07-21 15:07:10 [INFO]	[TRAIN] epoch: 1, iter: 2690/20000, loss: 0.3792, lr: 0.008781, batch_cost: 0.2412, reader_cost: 0.11824, ips: 16.5847 samples/sec | ETA 01:09:34
2022-07-21 15:07:13 [INFO]	[TRAIN] epoch: 1, iter: 2700/20000, loss: 0.4578, lr: 0.008777, batch_cost: 0.2364, reader_cost: 0.10836, ips: 16.9208 samples/sec | ETA 01:08:09
2022-07-21 15:07:15 [INFO]	[TRAIN] epoch: 1, iter: 2710/20000, loss: 0.3737, lr: 0.008772, batch_cost: 0.2423, reader_cost: 0.11718, ips: 16.5116 samples/sec | ETA 01:09:48
2022-07-21 15:07:18 [INFO]	[TRAIN] epoch: 1, iter: 2720/20000, loss: 0.4554, lr: 0.008768, batch_cost: 0.2533, reader_cost: 0.12977, ips: 15.7933 samples/sec | ETA 01:12:56
2022-07-21 15:07:20 [INFO]	[TRAIN] epoch: 1, iter: 2730/20000, loss: 0.4278, lr: 0.008763, batch_cost: 0.2412, reader_cost: 0.12108, ips: 16.5827 samples/sec | ETA 01:09:25
2022-07-21 15:07:23 [INFO]	[TRAIN] epoch: 1, iter: 2740/20000, loss: 0.4343, lr: 0.008759, batch_cost: 0.2429, reader_cost: 0.11634, ips: 16.4695 samples/sec | ETA 01:09:52
2022-07-21 15:07:25 [INFO]	[TRAIN] epoch: 1, iter: 2750/20000, loss: 0.4592, lr: 0.008754, batch_cost: 0.2404, reader_cost: 0.11674, ips: 16.6374 samples/sec | ETA 01:09:07
2022-07-21 15:07:27 [INFO]	[TRAIN] epoch: 1, iter: 2760/20000, loss: 0.4283, lr: 0.008749, batch_cost: 0.2322, reader_cost: 0.10676, ips: 17.2263 samples/sec | ETA 01:06:43
2022-07-21 15:07:30 [INFO]	[TRAIN] epoch: 1, iter: 2770/20000, loss: 0.3794, lr: 0.008745, batch_cost: 0.2285, reader_cost: 0.10557, ips: 17.5034 samples/sec | ETA 01:05:37
2022-07-21 15:07:32 [INFO]	[TRAIN] epoch: 1, iter: 2780/20000, loss: 0.3483, lr: 0.008740, batch_cost: 0.2592, reader_cost: 0.11846, ips: 15.4315 samples/sec | ETA 01:14:23
2022-07-21 15:07:35 [INFO]	[TRAIN] epoch: 1, iter: 2790/20000, loss: 0.3371, lr: 0.008736, batch_cost: 0.2517, reader_cost: 0.12049, ips: 15.8949 samples/sec | ETA 01:12:10
2022-07-21 15:07:38 [INFO]	[TRAIN] epoch: 1, iter: 2800/20000, loss: 0.3481, lr: 0.008731, batch_cost: 0.2841, reader_cost: 0.13740, ips: 14.0806 samples/sec | ETA 01:21:26
2022-07-21 15:07:40 [INFO]	[TRAIN] epoch: 1, iter: 2810/20000, loss: 0.6683, lr: 0.008727, batch_cost: 0.2555, reader_cost: 0.12957, ips: 15.6568 samples/sec | ETA 01:13:11
2022-07-21 15:07:43 [INFO]	[TRAIN] epoch: 1, iter: 2820/20000, loss: 0.2658, lr: 0.008722, batch_cost: 0.2517, reader_cost: 0.12430, ips: 15.8950 samples/sec | ETA 01:12:03
2022-07-21 15:07:45 [INFO]	[TRAIN] epoch: 1, iter: 2830/20000, loss: 0.3860, lr: 0.008717, batch_cost: 0.2466, reader_cost: 0.12234, ips: 16.2178 samples/sec | ETA 01:10:34
2022-07-21 15:07:48 [INFO]	[TRAIN] epoch: 1, iter: 2840/20000, loss: 0.3310, lr: 0.008713, batch_cost: 0.2559, reader_cost: 0.12597, ips: 15.6329 samples/sec | ETA 01:13:10
2022-07-21 15:07:50 [INFO]	[TRAIN] epoch: 1, iter: 2850/20000, loss: 0.4162, lr: 0.008708, batch_cost: 0.2468, reader_cost: 0.11942, ips: 16.2069 samples/sec | ETA 01:10:32
2022-07-21 15:07:52 [INFO]	[TRAIN] epoch: 1, iter: 2860/20000, loss: 0.5188, lr: 0.008704, batch_cost: 0.2288, reader_cost: 0.10441, ips: 17.4857 samples/sec | ETA 01:05:20
2022-07-21 15:07:55 [INFO]	[TRAIN] epoch: 1, iter: 2870/20000, loss: 0.5634, lr: 0.008699, batch_cost: 0.2387, reader_cost: 0.11674, ips: 16.7551 samples/sec | ETA 01:08:09
2022-07-21 15:07:57 [INFO]	[TRAIN] epoch: 1, iter: 2880/20000, loss: 0.3509, lr: 0.008695, batch_cost: 0.2438, reader_cost: 0.11976, ips: 16.4092 samples/sec | ETA 01:09:33
2022-07-21 15:08:00 [INFO]	[TRAIN] epoch: 1, iter: 2890/20000, loss: 0.3994, lr: 0.008690, batch_cost: 0.2492, reader_cost: 0.12511, ips: 16.0540 samples/sec | ETA 01:11:03
2022-07-21 15:08:02 [INFO]	[TRAIN] epoch: 1, iter: 2900/20000, loss: 0.3933, lr: 0.008685, batch_cost: 0.2505, reader_cost: 0.12836, ips: 15.9704 samples/sec | ETA 01:11:22
2022-07-21 15:08:05 [INFO]	[TRAIN] epoch: 1, iter: 2910/20000, loss: 0.3235, lr: 0.008681, batch_cost: 0.2913, reader_cost: 0.14790, ips: 13.7321 samples/sec | ETA 01:22:58
2022-07-21 15:08:08 [INFO]	[TRAIN] epoch: 1, iter: 2920/20000, loss: 0.3627, lr: 0.008676, batch_cost: 0.3357, reader_cost: 0.17281, ips: 11.9137 samples/sec | ETA 01:35:34
2022-07-21 15:08:12 [INFO]	[TRAIN] epoch: 1, iter: 2930/20000, loss: 0.4665, lr: 0.008672, batch_cost: 0.3085, reader_cost: 0.16018, ips: 12.9654 samples/sec | ETA 01:27:46
2022-07-21 15:08:14 [INFO]	[TRAIN] epoch: 1, iter: 2940/20000, loss: 0.3812, lr: 0.008667, batch_cost: 0.2514, reader_cost: 0.12692, ips: 15.9119 samples/sec | ETA 01:11:28
2022-07-21 15:08:16 [INFO]	[TRAIN] epoch: 1, iter: 2950/20000, loss: 0.5189, lr: 0.008663, batch_cost: 0.2301, reader_cost: 0.10745, ips: 17.3869 samples/sec | ETA 01:05:22
2022-07-21 15:08:19 [INFO]	[TRAIN] epoch: 1, iter: 2960/20000, loss: 0.2923, lr: 0.008658, batch_cost: 0.2290, reader_cost: 0.10596, ips: 17.4666 samples/sec | ETA 01:05:02
2022-07-21 15:08:21 [INFO]	[TRAIN] epoch: 1, iter: 2970/20000, loss: 0.4297, lr: 0.008653, batch_cost: 0.2381, reader_cost: 0.11619, ips: 16.7995 samples/sec | ETA 01:07:34
2022-07-21 15:08:23 [INFO]	[TRAIN] epoch: 1, iter: 2980/20000, loss: 0.4020, lr: 0.008649, batch_cost: 0.2364, reader_cost: 0.11324, ips: 16.9234 samples/sec | ETA 01:07:02
2022-07-21 15:08:26 [INFO]	[TRAIN] epoch: 1, iter: 2990/20000, loss: 0.4838, lr: 0.008644, batch_cost: 0.2356, reader_cost: 0.11406, ips: 16.9800 samples/sec | ETA 01:06:47
2022-07-21 15:08:28 [INFO]	[TRAIN] epoch: 1, iter: 3000/20000, loss: 0.4002, lr: 0.008640, batch_cost: 0.2298, reader_cost: 0.10783, ips: 17.4047 samples/sec | ETA 01:05:07
2022-07-21 15:08:31 [INFO]	[TRAIN] epoch: 1, iter: 3010/20000, loss: 0.3220, lr: 0.008635, batch_cost: 0.2573, reader_cost: 0.13225, ips: 15.5464 samples/sec | ETA 01:12:51
2022-07-21 15:08:33 [INFO]	[TRAIN] epoch: 1, iter: 3020/20000, loss: 0.4214, lr: 0.008631, batch_cost: 0.2399, reader_cost: 0.11714, ips: 16.6726 samples/sec | ETA 01:07:53
2022-07-21 15:08:35 [INFO]	[TRAIN] epoch: 1, iter: 3030/20000, loss: 0.4681, lr: 0.008626, batch_cost: 0.2418, reader_cost: 0.11388, ips: 16.5410 samples/sec | ETA 01:08:23
2022-07-21 15:08:38 [INFO]	[TRAIN] epoch: 1, iter: 3040/20000, loss: 0.4758, lr: 0.008621, batch_cost: 0.3001, reader_cost: 0.13911, ips: 13.3291 samples/sec | ETA 01:24:49
2022-07-21 15:08:41 [INFO]	[TRAIN] epoch: 1, iter: 3050/20000, loss: 0.3360, lr: 0.008617, batch_cost: 0.2690, reader_cost: 0.13264, ips: 14.8697 samples/sec | ETA 01:15:59
2022-07-21 15:08:44 [INFO]	[TRAIN] epoch: 1, iter: 3060/20000, loss: 0.4790, lr: 0.008612, batch_cost: 0.2590, reader_cost: 0.09907, ips: 15.4436 samples/sec | ETA 01:13:07
2022-07-21 15:08:46 [INFO]	[TRAIN] epoch: 1, iter: 3070/20000, loss: 0.5001, lr: 0.008608, batch_cost: 0.2672, reader_cost: 0.13652, ips: 14.9673 samples/sec | ETA 01:15:24
2022-07-21 15:08:49 [INFO]	[TRAIN] epoch: 1, iter: 3080/20000, loss: 0.4304, lr: 0.008603, batch_cost: 0.2563, reader_cost: 0.12152, ips: 15.6095 samples/sec | ETA 01:12:15
2022-07-21 15:08:51 [INFO]	[TRAIN] epoch: 1, iter: 3090/20000, loss: 0.3764, lr: 0.008599, batch_cost: 0.2280, reader_cost: 0.09550, ips: 17.5428 samples/sec | ETA 01:04:15
2022-07-21 15:08:54 [INFO]	[TRAIN] epoch: 1, iter: 3100/20000, loss: 0.4385, lr: 0.008594, batch_cost: 0.2351, reader_cost: 0.10701, ips: 17.0156 samples/sec | ETA 01:06:12
2022-07-21 15:08:56 [INFO]	[TRAIN] epoch: 1, iter: 3110/20000, loss: 0.4904, lr: 0.008589, batch_cost: 0.2319, reader_cost: 0.10158, ips: 17.2489 samples/sec | ETA 01:05:16
2022-07-21 15:08:58 [INFO]	[TRAIN] epoch: 1, iter: 3120/20000, loss: 0.4073, lr: 0.008585, batch_cost: 0.2333, reader_cost: 0.10906, ips: 17.1489 samples/sec | ETA 01:05:37
2022-07-21 15:09:01 [INFO]	[TRAIN] epoch: 1, iter: 3130/20000, loss: 0.5778, lr: 0.008580, batch_cost: 0.2443, reader_cost: 0.11885, ips: 16.3709 samples/sec | ETA 01:08:41
2022-07-21 15:09:03 [INFO]	[TRAIN] epoch: 1, iter: 3140/20000, loss: 0.3426, lr: 0.008576, batch_cost: 0.2364, reader_cost: 0.11508, ips: 16.9217 samples/sec | ETA 01:06:25
2022-07-21 15:09:06 [INFO]	[TRAIN] epoch: 1, iter: 3150/20000, loss: 0.4100, lr: 0.008571, batch_cost: 0.2420, reader_cost: 0.11801, ips: 16.5299 samples/sec | ETA 01:07:57
2022-07-21 15:09:08 [INFO]	[TRAIN] epoch: 1, iter: 3160/20000, loss: 0.5320, lr: 0.008567, batch_cost: 0.2782, reader_cost: 0.14625, ips: 14.3782 samples/sec | ETA 01:18:04
2022-07-21 15:09:11 [INFO]	[TRAIN] epoch: 1, iter: 3170/20000, loss: 0.4115, lr: 0.008562, batch_cost: 0.2430, reader_cost: 0.11660, ips: 16.4625 samples/sec | ETA 01:08:09
2022-07-21 15:09:13 [INFO]	[TRAIN] epoch: 1, iter: 3180/20000, loss: 0.3951, lr: 0.008557, batch_cost: 0.2533, reader_cost: 0.11296, ips: 15.7895 samples/sec | ETA 01:11:01
2022-07-21 15:09:17 [INFO]	[TRAIN] epoch: 1, iter: 3190/20000, loss: 0.5235, lr: 0.008553, batch_cost: 0.3509, reader_cost: 0.17824, ips: 11.4000 samples/sec | ETA 01:38:18
2022-07-21 15:09:19 [INFO]	[TRAIN] epoch: 1, iter: 3200/20000, loss: 0.3579, lr: 0.008548, batch_cost: 0.2595, reader_cost: 0.13354, ips: 15.4148 samples/sec | ETA 01:12:39
2022-07-21 15:09:22 [INFO]	[TRAIN] epoch: 1, iter: 3210/20000, loss: 0.4029, lr: 0.008544, batch_cost: 0.2347, reader_cost: 0.10922, ips: 17.0406 samples/sec | ETA 01:05:41
2022-07-21 15:09:24 [INFO]	[TRAIN] epoch: 1, iter: 3220/20000, loss: 0.3379, lr: 0.008539, batch_cost: 0.2402, reader_cost: 0.11842, ips: 16.6521 samples/sec | ETA 01:07:10
2022-07-21 15:09:27 [INFO]	[TRAIN] epoch: 1, iter: 3230/20000, loss: 0.3596, lr: 0.008534, batch_cost: 0.2484, reader_cost: 0.12530, ips: 16.1001 samples/sec | ETA 01:09:26
2022-07-21 15:09:29 [INFO]	[TRAIN] epoch: 1, iter: 3240/20000, loss: 0.3281, lr: 0.008530, batch_cost: 0.2375, reader_cost: 0.11461, ips: 16.8400 samples/sec | ETA 01:06:20
2022-07-21 15:09:31 [INFO]	[TRAIN] epoch: 1, iter: 3250/20000, loss: 0.4098, lr: 0.008525, batch_cost: 0.2428, reader_cost: 0.12111, ips: 16.4771 samples/sec | ETA 01:07:46
2022-07-21 15:09:34 [INFO]	[TRAIN] epoch: 1, iter: 3260/20000, loss: 0.5891, lr: 0.008521, batch_cost: 0.2124, reader_cost: 0.09167, ips: 18.8334 samples/sec | ETA 00:59:15
2022-07-21 15:09:36 [INFO]	[TRAIN] epoch: 1, iter: 3270/20000, loss: 0.3651, lr: 0.008516, batch_cost: 0.2390, reader_cost: 0.11789, ips: 16.7350 samples/sec | ETA 01:06:38
2022-07-21 15:09:38 [INFO]	[TRAIN] epoch: 1, iter: 3280/20000, loss: 0.5262, lr: 0.008512, batch_cost: 0.2387, reader_cost: 0.11501, ips: 16.7596 samples/sec | ETA 01:06:30
2022-07-21 15:09:41 [INFO]	[TRAIN] epoch: 1, iter: 3290/20000, loss: 0.4970, lr: 0.008507, batch_cost: 0.2960, reader_cost: 0.13647, ips: 13.5137 samples/sec | ETA 01:22:26
2022-07-21 15:09:44 [INFO]	[TRAIN] epoch: 1, iter: 3300/20000, loss: 0.3668, lr: 0.008502, batch_cost: 0.2492, reader_cost: 0.11549, ips: 16.0526 samples/sec | ETA 01:09:21
2022-07-21 15:09:46 [INFO]	[TRAIN] epoch: 1, iter: 3310/20000, loss: 0.3535, lr: 0.008498, batch_cost: 0.2451, reader_cost: 0.11636, ips: 16.3201 samples/sec | ETA 01:08:10
2022-07-21 15:09:49 [INFO]	[TRAIN] epoch: 1, iter: 3320/20000, loss: 0.3139, lr: 0.008493, batch_cost: 0.2465, reader_cost: 0.11889, ips: 16.2254 samples/sec | ETA 01:08:32
2022-07-21 15:09:51 [INFO]	[TRAIN] epoch: 1, iter: 3330/20000, loss: 0.4525, lr: 0.008489, batch_cost: 0.2469, reader_cost: 0.11901, ips: 16.1990 samples/sec | ETA 01:08:36
2022-07-21 15:09:54 [INFO]	[TRAIN] epoch: 1, iter: 3340/20000, loss: 0.3956, lr: 0.008484, batch_cost: 0.2545, reader_cost: 0.11526, ips: 15.7196 samples/sec | ETA 01:10:39
2022-07-21 15:09:56 [INFO]	[TRAIN] epoch: 1, iter: 3350/20000, loss: 0.3553, lr: 0.008479, batch_cost: 0.2430, reader_cost: 0.11879, ips: 16.4589 samples/sec | ETA 01:07:26
2022-07-21 15:09:59 [INFO]	[TRAIN] epoch: 1, iter: 3360/20000, loss: 0.2864, lr: 0.008475, batch_cost: 0.2427, reader_cost: 0.11591, ips: 16.4789 samples/sec | ETA 01:07:19
2022-07-21 15:10:01 [INFO]	[TRAIN] epoch: 1, iter: 3370/20000, loss: 0.3513, lr: 0.008470, batch_cost: 0.2428, reader_cost: 0.12043, ips: 16.4723 samples/sec | ETA 01:07:18
2022-07-21 15:10:03 [INFO]	[TRAIN] epoch: 1, iter: 3380/20000, loss: 0.5496, lr: 0.008466, batch_cost: 0.2468, reader_cost: 0.12361, ips: 16.2046 samples/sec | ETA 01:08:22
2022-07-21 15:10:06 [INFO]	[TRAIN] epoch: 1, iter: 3390/20000, loss: 0.4567, lr: 0.008461, batch_cost: 0.2314, reader_cost: 0.10747, ips: 17.2866 samples/sec | ETA 01:04:03
2022-07-21 15:10:08 [INFO]	[TRAIN] epoch: 1, iter: 3400/20000, loss: 0.3446, lr: 0.008457, batch_cost: 0.2300, reader_cost: 0.10583, ips: 17.3882 samples/sec | ETA 01:03:38
2022-07-21 15:10:11 [INFO]	[TRAIN] epoch: 1, iter: 3410/20000, loss: 0.4683, lr: 0.008452, batch_cost: 0.2498, reader_cost: 0.12146, ips: 16.0114 samples/sec | ETA 01:09:04
2022-07-21 15:10:13 [INFO]	[TRAIN] epoch: 1, iter: 3420/20000, loss: 0.3715, lr: 0.008447, batch_cost: 0.2760, reader_cost: 0.14626, ips: 14.4949 samples/sec | ETA 01:16:15
2022-07-21 15:10:16 [INFO]	[TRAIN] epoch: 1, iter: 3430/20000, loss: 0.2925, lr: 0.008443, batch_cost: 0.2426, reader_cost: 0.11785, ips: 16.4884 samples/sec | ETA 01:06:59
2022-07-21 15:10:18 [INFO]	[TRAIN] epoch: 1, iter: 3440/20000, loss: 0.3652, lr: 0.008438, batch_cost: 0.2420, reader_cost: 0.11866, ips: 16.5270 samples/sec | ETA 01:06:47
2022-07-21 15:10:21 [INFO]	[TRAIN] epoch: 1, iter: 3450/20000, loss: 0.4360, lr: 0.008434, batch_cost: 0.3193, reader_cost: 0.13887, ips: 12.5285 samples/sec | ETA 01:28:03
2022-07-21 15:10:25 [INFO]	[TRAIN] epoch: 1, iter: 3460/20000, loss: 0.3057, lr: 0.008429, batch_cost: 0.3115, reader_cost: 0.16518, ips: 12.8408 samples/sec | ETA 01:25:52
2022-07-21 15:10:27 [INFO]	[TRAIN] epoch: 1, iter: 3470/20000, loss: 0.3782, lr: 0.008424, batch_cost: 0.2622, reader_cost: 0.13734, ips: 15.2554 samples/sec | ETA 01:12:14
2022-07-21 15:10:29 [INFO]	[TRAIN] epoch: 1, iter: 3480/20000, loss: 0.4177, lr: 0.008420, batch_cost: 0.2305, reader_cost: 0.10780, ips: 17.3554 samples/sec | ETA 01:03:27
2022-07-21 15:10:32 [INFO]	[TRAIN] epoch: 1, iter: 3490/20000, loss: 0.4046, lr: 0.008415, batch_cost: 0.2430, reader_cost: 0.11984, ips: 16.4613 samples/sec | ETA 01:06:51
2022-07-21 15:10:34 [INFO]	[TRAIN] epoch: 1, iter: 3500/20000, loss: 0.5678, lr: 0.008411, batch_cost: 0.2346, reader_cost: 0.11176, ips: 17.0526 samples/sec | ETA 01:04:30
2022-07-21 15:10:37 [INFO]	[TRAIN] epoch: 1, iter: 3510/20000, loss: 0.3199, lr: 0.008406, batch_cost: 0.2688, reader_cost: 0.14579, ips: 14.8797 samples/sec | ETA 01:13:52
2022-07-21 15:10:39 [INFO]	[TRAIN] epoch: 1, iter: 3520/20000, loss: 0.4008, lr: 0.008402, batch_cost: 0.2495, reader_cost: 0.12521, ips: 16.0340 samples/sec | ETA 01:08:31
2022-07-21 15:10:42 [INFO]	[TRAIN] epoch: 1, iter: 3530/20000, loss: 0.3773, lr: 0.008397, batch_cost: 0.2504, reader_cost: 0.12554, ips: 15.9746 samples/sec | ETA 01:08:44
2022-07-21 15:10:45 [INFO]	[TRAIN] epoch: 1, iter: 3540/20000, loss: 0.3974, lr: 0.008392, batch_cost: 0.3120, reader_cost: 0.15467, ips: 12.8215 samples/sec | ETA 01:25:35
2022-07-21 15:10:48 [INFO]	[TRAIN] epoch: 1, iter: 3550/20000, loss: 0.5607, lr: 0.008388, batch_cost: 0.2503, reader_cost: 0.11908, ips: 15.9779 samples/sec | ETA 01:08:38
2022-07-21 15:10:50 [INFO]	[TRAIN] epoch: 1, iter: 3560/20000, loss: 0.4736, lr: 0.008383, batch_cost: 0.2383, reader_cost: 0.11640, ips: 16.7887 samples/sec | ETA 01:05:16
2022-07-21 15:10:52 [INFO]	[TRAIN] epoch: 1, iter: 3570/20000, loss: 0.4516, lr: 0.008379, batch_cost: 0.2380, reader_cost: 0.10482, ips: 16.8095 samples/sec | ETA 01:05:09
2022-07-21 15:10:55 [INFO]	[TRAIN] epoch: 1, iter: 3580/20000, loss: 0.4875, lr: 0.008374, batch_cost: 0.2502, reader_cost: 0.10365, ips: 15.9847 samples/sec | ETA 01:08:28
2022-07-21 15:10:57 [INFO]	[TRAIN] epoch: 1, iter: 3590/20000, loss: 0.3739, lr: 0.008369, batch_cost: 0.2681, reader_cost: 0.13759, ips: 14.9211 samples/sec | ETA 01:13:19
2022-07-21 15:11:00 [INFO]	[TRAIN] epoch: 1, iter: 3600/20000, loss: 0.3562, lr: 0.008365, batch_cost: 0.2696, reader_cost: 0.13799, ips: 14.8369 samples/sec | ETA 01:13:41
2022-07-21 15:11:03 [INFO]	[TRAIN] epoch: 1, iter: 3610/20000, loss: 0.3799, lr: 0.008360, batch_cost: 0.2501, reader_cost: 0.12651, ips: 15.9953 samples/sec | ETA 01:08:18
2022-07-21 15:11:05 [INFO]	[TRAIN] epoch: 1, iter: 3620/20000, loss: 0.3751, lr: 0.008356, batch_cost: 0.2542, reader_cost: 0.13135, ips: 15.7363 samples/sec | ETA 01:09:23
2022-07-21 15:11:08 [INFO]	[TRAIN] epoch: 1, iter: 3630/20000, loss: 0.3792, lr: 0.008351, batch_cost: 0.2369, reader_cost: 0.11338, ips: 16.8852 samples/sec | ETA 01:04:37
2022-07-21 15:11:10 [INFO]	[TRAIN] epoch: 1, iter: 3640/20000, loss: 0.5071, lr: 0.008346, batch_cost: 0.2429, reader_cost: 0.12067, ips: 16.4672 samples/sec | ETA 01:06:13
2022-07-21 15:11:12 [INFO]	[TRAIN] epoch: 1, iter: 3650/20000, loss: 0.4547, lr: 0.008342, batch_cost: 0.2361, reader_cost: 0.11423, ips: 16.9431 samples/sec | ETA 01:04:19
2022-07-21 15:11:15 [INFO]	[TRAIN] epoch: 1, iter: 3660/20000, loss: 0.3599, lr: 0.008337, batch_cost: 0.2553, reader_cost: 0.11689, ips: 15.6654 samples/sec | ETA 01:09:32
2022-07-21 15:11:17 [INFO]	[TRAIN] epoch: 1, iter: 3670/20000, loss: 0.3250, lr: 0.008333, batch_cost: 0.2538, reader_cost: 0.12995, ips: 15.7627 samples/sec | ETA 01:09:03
2022-07-21 15:11:20 [INFO]	[TRAIN] epoch: 1, iter: 3680/20000, loss: 0.3162, lr: 0.008328, batch_cost: 0.2244, reader_cost: 0.10295, ips: 17.8235 samples/sec | ETA 01:01:02
2022-07-21 15:11:22 [INFO]	[TRAIN] epoch: 1, iter: 3690/20000, loss: 0.5091, lr: 0.008323, batch_cost: 0.2178, reader_cost: 0.09620, ips: 18.3664 samples/sec | ETA 00:59:12
2022-07-21 15:11:24 [INFO]	[TRAIN] epoch: 1, iter: 3700/20000, loss: 0.4181, lr: 0.008319, batch_cost: 0.2374, reader_cost: 0.11249, ips: 16.8500 samples/sec | ETA 01:04:29
2022-07-21 15:11:27 [INFO]	[TRAIN] epoch: 1, iter: 3710/20000, loss: 0.4653, lr: 0.008314, batch_cost: 0.3121, reader_cost: 0.15375, ips: 12.8181 samples/sec | ETA 01:24:43
2022-07-21 15:11:31 [INFO]	[TRAIN] epoch: 1, iter: 3720/20000, loss: 0.4560, lr: 0.008310, batch_cost: 0.3251, reader_cost: 0.15987, ips: 12.3042 samples/sec | ETA 01:28:12
2022-07-21 15:11:33 [INFO]	[TRAIN] epoch: 1, iter: 3730/20000, loss: 0.5724, lr: 0.008305, batch_cost: 0.2188, reader_cost: 0.09669, ips: 18.2796 samples/sec | ETA 00:59:20
2022-07-21 15:11:35 [INFO]	[TRAIN] epoch: 1, iter: 3740/20000, loss: 0.3923, lr: 0.008301, batch_cost: 0.2312, reader_cost: 0.10808, ips: 17.2975 samples/sec | ETA 01:02:40
2022-07-21 15:11:38 [INFO]	[TRAIN] epoch: 1, iter: 3750/20000, loss: 0.5189, lr: 0.008296, batch_cost: 0.2368, reader_cost: 0.11319, ips: 16.8946 samples/sec | ETA 01:04:07
2022-07-21 15:11:40 [INFO]	[TRAIN] epoch: 1, iter: 3760/20000, loss: 0.3511, lr: 0.008291, batch_cost: 0.2488, reader_cost: 0.12282, ips: 16.0798 samples/sec | ETA 01:07:19
2022-07-21 15:11:43 [INFO]	[TRAIN] epoch: 1, iter: 3770/20000, loss: 0.3467, lr: 0.008287, batch_cost: 0.2530, reader_cost: 0.12898, ips: 15.8103 samples/sec | ETA 01:08:26
2022-07-21 15:11:45 [INFO]	[TRAIN] epoch: 1, iter: 3780/20000, loss: 0.5450, lr: 0.008282, batch_cost: 0.2489, reader_cost: 0.12431, ips: 16.0705 samples/sec | ETA 01:07:17
2022-07-21 15:11:48 [INFO]	[TRAIN] epoch: 1, iter: 3790/20000, loss: 0.3520, lr: 0.008278, batch_cost: 0.3173, reader_cost: 0.17016, ips: 12.6081 samples/sec | ETA 01:25:42
2022-07-21 15:11:51 [INFO]	[TRAIN] epoch: 1, iter: 3800/20000, loss: 0.4174, lr: 0.008273, batch_cost: 0.2581, reader_cost: 0.12382, ips: 15.4997 samples/sec | ETA 01:09:40
2022-07-21 15:11:53 [INFO]	[TRAIN] epoch: 1, iter: 3810/20000, loss: 0.3643, lr: 0.008268, batch_cost: 0.2344, reader_cost: 0.11195, ips: 17.0658 samples/sec | ETA 01:03:14
2022-07-21 15:11:55 [INFO]	[TRAIN] epoch: 1, iter: 3820/20000, loss: 0.4225, lr: 0.008264, batch_cost: 0.2220, reader_cost: 0.09959, ips: 18.0177 samples/sec | ETA 00:59:52
2022-07-21 15:11:58 [INFO]	[TRAIN] epoch: 1, iter: 3830/20000, loss: 0.4806, lr: 0.008259, batch_cost: 0.2449, reader_cost: 0.12058, ips: 16.3342 samples/sec | ETA 01:05:59
2022-07-21 15:12:00 [INFO]	[TRAIN] epoch: 1, iter: 3840/20000, loss: 0.4841, lr: 0.008255, batch_cost: 0.2516, reader_cost: 0.12001, ips: 15.8976 samples/sec | ETA 01:07:46
2022-07-21 15:12:03 [INFO]	[TRAIN] epoch: 1, iter: 3850/20000, loss: 0.3473, lr: 0.008250, batch_cost: 0.2560, reader_cost: 0.12997, ips: 15.6278 samples/sec | ETA 01:08:53
2022-07-21 15:12:06 [INFO]	[TRAIN] epoch: 1, iter: 3860/20000, loss: 0.3111, lr: 0.008245, batch_cost: 0.2639, reader_cost: 0.13834, ips: 15.1549 samples/sec | ETA 01:11:00
2022-07-21 15:12:08 [INFO]	[TRAIN] epoch: 1, iter: 3870/20000, loss: 0.4574, lr: 0.008241, batch_cost: 0.2412, reader_cost: 0.11422, ips: 16.5823 samples/sec | ETA 01:04:50
2022-07-21 15:12:10 [INFO]	[TRAIN] epoch: 1, iter: 3880/20000, loss: 0.3299, lr: 0.008236, batch_cost: 0.2274, reader_cost: 0.10385, ips: 17.5868 samples/sec | ETA 01:01:06
2022-07-21 15:12:13 [INFO]	[TRAIN] epoch: 1, iter: 3890/20000, loss: 0.3384, lr: 0.008232, batch_cost: 0.2360, reader_cost: 0.11281, ips: 16.9478 samples/sec | ETA 01:03:22
2022-07-21 15:12:15 [INFO]	[TRAIN] epoch: 1, iter: 3900/20000, loss: 0.4258, lr: 0.008227, batch_cost: 0.2426, reader_cost: 0.11998, ips: 16.4885 samples/sec | ETA 01:05:05
2022-07-21 15:12:17 [INFO]	[TRAIN] epoch: 1, iter: 3910/20000, loss: 0.3746, lr: 0.008222, batch_cost: 0.2416, reader_cost: 0.11829, ips: 16.5560 samples/sec | ETA 01:04:47
2022-07-21 15:12:20 [INFO]	[TRAIN] epoch: 1, iter: 3920/20000, loss: 0.5183, lr: 0.008218, batch_cost: 0.2990, reader_cost: 0.15620, ips: 13.3796 samples/sec | ETA 01:20:07
2022-07-21 15:12:23 [INFO]	[TRAIN] epoch: 1, iter: 3930/20000, loss: 0.4921, lr: 0.008213, batch_cost: 0.2354, reader_cost: 0.10985, ips: 16.9924 samples/sec | ETA 01:03:02
2022-07-21 15:12:25 [INFO]	[TRAIN] epoch: 1, iter: 3940/20000, loss: 0.2848, lr: 0.008209, batch_cost: 0.2443, reader_cost: 0.11316, ips: 16.3739 samples/sec | ETA 01:05:23
2022-07-21 15:12:28 [INFO]	[TRAIN] epoch: 1, iter: 3950/20000, loss: 0.3905, lr: 0.008204, batch_cost: 0.2505, reader_cost: 0.12296, ips: 15.9700 samples/sec | ETA 01:07:00
2022-07-21 15:12:30 [INFO]	[TRAIN] epoch: 1, iter: 3960/20000, loss: 0.5582, lr: 0.008199, batch_cost: 0.2369, reader_cost: 0.11257, ips: 16.8871 samples/sec | ETA 01:03:19
2022-07-21 15:12:33 [INFO]	[TRAIN] epoch: 1, iter: 3970/20000, loss: 0.3308, lr: 0.008195, batch_cost: 0.2711, reader_cost: 0.14767, ips: 14.7564 samples/sec | ETA 01:12:25
2022-07-21 15:12:36 [INFO]	[TRAIN] epoch: 1, iter: 3980/20000, loss: 0.3520, lr: 0.008190, batch_cost: 0.3625, reader_cost: 0.18163, ips: 11.0350 samples/sec | ETA 01:36:46
2022-07-21 15:12:39 [INFO]	[TRAIN] epoch: 1, iter: 3990/20000, loss: 0.4225, lr: 0.008186, batch_cost: 0.2915, reader_cost: 0.15763, ips: 13.7239 samples/sec | ETA 01:17:46
2022-07-21 15:12:42 [INFO]	[TRAIN] epoch: 1, iter: 4000/20000, loss: 0.4342, lr: 0.008181, batch_cost: 0.2236, reader_cost: 0.09971, ips: 17.8880 samples/sec | ETA 00:59:37
2022-07-21 15:12:42 [INFO]	Start evaluating (total_samples: 7361, total_iters: 7361)...
7361/7361 [==============================] - 456s 62ms/step - batch_cost: 0.0617 - reader cost: 0.0083: 5:19 - batch_cost: 0.0579 - rea - ETA: 5:11 - - ETA: 5:08 - batch_cos - ETA: 5:07 - batch_cost: 0.0586 - ETA: 5:08 - batch_cost: 0.0591 - reader   - ETA: 5:06 - batch_cost: 0. - ETA: 5:06 - batch_cost: 0.0598 - read - ETA: 5:06 - batch - ET - ETA: 4:49 - batch_cost: 0.0606 - reader co - ETA: 4:49 - batch_cost: 0.0606 - reader cost: 0.0 - ETA - ETA: - ETA: 4:26 - batch_cost: 0 - ETA: 4:24 - batch_co - ETA: 4:21 - batch_cost: 0.0612 - reader cost: - ETA: 4:21 - batch_cost: 0.0613 - reader  - ETA: 4:21 - batch_cost: 0.0614 - reader cost: 0.0 - ETA: 4:21 - batch_cost: - ETA: 4:21 - batch_cost: 0.0619 - reader cost: - ETA: 4:21 - batch_cost: 0.0620 - reader cost: 0.0 - ETA: 4:21 - batch_cost: 0.0620 - reader cost: 0 - ETA: 4:20 - batch_cost: 0.0620 - reader cost: 0. - ETA: 4:20 - batch - ETA: 4:19 - batch_cost: 0.0623 - reader cost: 0.0 - ETA: 4:18 - batch_cost: 0.0623 - reader - ETA: 4:13 - batch_cost: 0.0624 - reader cost: - ETA: 4:13 - ba - ETA: 4:10 - batch_cost - ETA: 4:07 - batch_cost: 0.0 - ETA: 4:00 - batch_cost: 0.0624 - reader cost: 0. - ETA: 4:00 - batch - ETA: 3:58 - batc - ETA: 3:55 - batch_cost: 0.0625  - ETA: 3:54 - batch_cost: 0.0626 - - ETA: 3:53 - batch_cost: 0.0626 - reader cost: 0.010 - ETA: 3:53 - batch_cost: 0.0626 - reader co - ETA: 3 - ETA: 3:39 - batch_cost: 0.0627 - reader cost: 0. - ETA: 3:39 - batch_cost: 0.0627 -  - ETA: 3:37 - batch_cost:  - ETA: 3:30 - batch_cost: 0.0627 - rea - ETA: 3:29 - batch_cost: 0.0627 - reader cost: 0.0 - ETA: 3:28 - batch_cost: 0.0627 - reader cos - ETA: 3:28 - batch_cost: 0.0627 - reader co - ETA: 3:26 - - ETA: 3:24 - batch_cost: 0.0628 - reader co - ETA: 3:23 - batch_cost: 0.0628 - reader co - ETA: 3:23 - batch_cost: - ETA: 3:18 - ba - ETA: 3:06 - batc - ETA: 3:03 - batch_cost: 0.0635 - reader cost: 0.01 - ETA: 3:03 - batch_cost: 0.0635 - reader cos - ETA: 3:02 - batch_cost: 0.0635 - reader cost: 0.0 - ETA: 3:02 - batch_cost: 0.0635 - reader co - ETA: 3:01 - batch - ETA: 2:54 - batch_cost: 0.0636 - reader cost: 0.01 - ETA: 2:54 - batch_cost - ETA: 2:52 - batch_cost: 0.0637 - reader cost: 0. - ETA: 2:51 - batch - ETA: 2:48 - batch_cost: 0.0637 - reader cost: 0.01 - ETA: 2:48 - b - ETA:  - ETA: 2:41 - batch_cost: 0.0637 - reader cost: 0. - ET - ETA: 2:33 - batch_cost: 0.0638 - reader cost - ETA: 2:32 -  - ETA: 2:29 - batch_cost: 0.0638 -  - ETA: 2:28 - batch_ - ETA: 2:25 - batch_cost: 0.0638 - reader  - ETA: 2:24 - - ETA: 2:09 - batch_co -  - ETA: 1:59 - batch_cost: 0.0637 - reader cost: 0.0 - ETA: 1:59 - batch_cost: 0.0637 - reader cost: 0. - ETA: 1:58 - batch_cost: 0.0637 - reader cost: 0 - ETA: 1:58 - batch_cost: 0.0636 - reader cost:  - ETA: 1:57 - b - ETA: 1:53 - batch_cost: 0.0636 - reader cost: 0.0 - ETA: 1:53 - batch_cost: 0.0635 - reader cos - ETA: 1:52 - ba - ETA: 1:48 - batch_cost: 0.0635 - reader cost: 0 - ETA: 1:47 - batch_cost: - ETA: 1:45 - batch_cost: 0.063 - ETA: 1:42 - batch_cost: 0 - ETA: 1:32 - batch_cost: 0.0632 - reader cost: 0.010 - ETA: 1:32 - batch_cost: 0.0631 - reader cost: 0 - ETA: 1:32 - batch_cost: 0. - ETA: 1:22 - batch_cost: 0.0628 - reader c - ETA: 1:20 - batch_cost: 0.0 - ETA: 1:18 - batch_cost: 0.0627 - reader cost - ETA: 1:17 - batch_cost: 0.0627 - reader cost: 0.009 - ETA: 1:17 - batch_cost: 0.0627 - reader - ETA: 1:15 - batch_cost: 0.0627 - reader cost: 0.009 - ETA: 1:15 - batch_cost: 0.0627 - reader cost - ETA: 1:14 - batch_cost: 0.0626 - reader cost: 0.009 - ETA: 1:14 - batch_cost: 0.0626 - reade - ETA: - ETA: 1:08 - batch_cost: 0.0626 - read - ETA: 1:07 - batch_cost: 0.0627 - rea - ETA: 1:00 - batch_cost: 0.0629 - reader c - ETA: 59 - ETA: 56 - ETA: 54s - batch_cost: 0.06 - ETA: 52s - batch_cost: 0.0627 - - ETA: 51s - batch_cost: 0. - ETA: 50s - batch_cost - ETA: 48s - batch_cost: 0.0626 - r - ETA: 47s - batch_cost:  - ETA: 45s - batch_cost: 0.0625 - rea - ETA: 44s - batch_cost: 0.0625 - reade - ETA: 36s - batch_cost: 0.0624 - reader  - ETA: 36s - batch_cost: 0.0624 - reader cost - ETA: 36s - batch_cost: 0.0624 - reader cost: 0.00 - ETA: 35s - batch_cost: 0.0624 - reade - ETA: 35s - batch_cost: 0.0625 - reader  - ETA:  - ETA: 16s - batch_co - ETA: 14s - batch_ - ETA: 12s - batch_cost: 0.0620 - reader cost:  - ETA: 12s - batch_cost:  - ETA: 11s - batch_cost: 0.0619 - reader  - ETA: 10s - batch_cost - ETA: 8s - batch_cost: 0.0619 - reader cost: 0.00 - ETA: 7s - batch_cost: 0.0619 - read
2022-07-21 15:20:18 [INFO]	[EVAL] #Images: 7361 mIoU: 0.0894 Acc: 0.9832 Kappa: 0.2964 Dice: 0.1093
2022-07-21 15:20:18 [INFO]	[EVAL] Class IoU: 
[0.9849 0.1079 0.0229 0.008  0.     0.     0.     0.     0.1458 0.0001
 0.     0.     0.     0.     0.     0.     0.     0.     0.     0.519 ]
2022-07-21 15:20:18 [INFO]	[EVAL] Class Precision: 
[0.9857 0.3042 0.2541 0.4175 0.     0.     0.     0.     0.5409 0.0209
 0.     0.     0.     0.     0.     0.     0.     0.     0.     0.768 ]
2022-07-21 15:20:18 [INFO]	[EVAL] Class Recall: 
[0.9991 0.1433 0.0246 0.0081 0.     0.     0.     0.     0.1664 0.0001
 0.     0.     0.     0.     0.     0.     0.     0.     0.     0.6155]
2022-07-21 15:20:18 [INFO]	[EVAL] The model with the best validation mIoU (0.0894) was saved at iter 4000.
2022-07-21 15:20:20 [INFO]	[TRAIN] epoch: 1, iter: 4010/20000, loss: 0.4041, lr: 0.008176, batch_cost: 0.1634, reader_cost: 0.02467, ips: 24.4795 samples/sec | ETA 00:43:32
2022-07-21 15:20:23 [INFO]	[TRAIN] epoch: 1, iter: 4020/20000, loss: 0.4797, lr: 0.008172, batch_cost: 0.2952, reader_cost: 0.14256, ips: 13.5496 samples/sec | ETA 01:18:37
2022-07-21 15:20:26 [INFO]	[TRAIN] epoch: 1, iter: 4030/20000, loss: 0.3249, lr: 0.008167, batch_cost: 0.2906, reader_cost: 0.15223, ips: 13.7627 samples/sec | ETA 01:17:21
2022-07-21 15:20:29 [INFO]	[TRAIN] epoch: 1, iter: 4040/20000, loss: 0.4118, lr: 0.008163, batch_cost: 0.3300, reader_cost: 0.14932, ips: 12.1217 samples/sec | ETA 01:27:46
2022-07-21 15:20:31 [INFO]	[TRAIN] epoch: 1, iter: 4050/20000, loss: 0.4153, lr: 0.008158, batch_cost: 0.2376, reader_cost: 0.10992, ips: 16.8326 samples/sec | ETA 01:03:10
2022-07-21 15:20:34 [INFO]	[TRAIN] epoch: 1, iter: 4060/20000, loss: 0.4541, lr: 0.008153, batch_cost: 0.2552, reader_cost: 0.13091, ips: 15.6735 samples/sec | ETA 01:07:48
2022-07-21 15:20:36 [INFO]	[TRAIN] epoch: 1, iter: 4070/20000, loss: 0.3979, lr: 0.008149, batch_cost: 0.2278, reader_cost: 0.10450, ips: 17.5590 samples/sec | ETA 01:00:28
2022-07-21 15:20:38 [INFO]	[TRAIN] epoch: 1, iter: 4080/20000, loss: 0.4290, lr: 0.008144, batch_cost: 0.2231, reader_cost: 0.10085, ips: 17.9284 samples/sec | ETA 00:59:11
2022-07-21 15:20:41 [INFO]	[TRAIN] epoch: 1, iter: 4090/20000, loss: 0.3676, lr: 0.008140, batch_cost: 0.2288, reader_cost: 0.10687, ips: 17.4815 samples/sec | ETA 01:00:40
2022-07-21 15:20:43 [INFO]	[TRAIN] epoch: 1, iter: 4100/20000, loss: 0.4343, lr: 0.008135, batch_cost: 0.2549, reader_cost: 0.13233, ips: 15.6907 samples/sec | ETA 01:07:33
2022-07-21 15:20:46 [INFO]	[TRAIN] epoch: 1, iter: 4110/20000, loss: 0.4673, lr: 0.008130, batch_cost: 0.2613, reader_cost: 0.13317, ips: 15.3104 samples/sec | ETA 01:09:11
2022-07-21 15:20:48 [INFO]	[TRAIN] epoch: 1, iter: 4120/20000, loss: 0.3649, lr: 0.008126, batch_cost: 0.2361, reader_cost: 0.10910, ips: 16.9442 samples/sec | ETA 01:02:28
2022-07-21 15:20:51 [INFO]	[TRAIN] epoch: 1, iter: 4130/20000, loss: 0.4073, lr: 0.008121, batch_cost: 0.2643, reader_cost: 0.12948, ips: 15.1324 samples/sec | ETA 01:09:54
2022-07-21 15:20:54 [INFO]	[TRAIN] epoch: 1, iter: 4140/20000, loss: 0.3604, lr: 0.008117, batch_cost: 0.2895, reader_cost: 0.14755, ips: 13.8187 samples/sec | ETA 01:16:30
2022-07-21 15:20:56 [INFO]	[TRAIN] epoch: 1, iter: 4150/20000, loss: 0.4729, lr: 0.008112, batch_cost: 0.2504, reader_cost: 0.12316, ips: 15.9760 samples/sec | ETA 01:06:08
2022-07-21 15:20:59 [INFO]	[TRAIN] epoch: 1, iter: 4160/20000, loss: 0.3199, lr: 0.008107, batch_cost: 0.2589, reader_cost: 0.13488, ips: 15.4475 samples/sec | ETA 01:08:21
2022-07-21 15:21:01 [INFO]	[TRAIN] epoch: 1, iter: 4170/20000, loss: 0.4568, lr: 0.008103, batch_cost: 0.2305, reader_cost: 0.10588, ips: 17.3526 samples/sec | ETA 01:00:49
2022-07-21 15:21:03 [INFO]	[TRAIN] epoch: 1, iter: 4180/20000, loss: 0.3283, lr: 0.008098, batch_cost: 0.2266, reader_cost: 0.10509, ips: 17.6551 samples/sec | ETA 00:59:44
2022-07-21 15:21:06 [INFO]	[TRAIN] epoch: 1, iter: 4190/20000, loss: 0.3070, lr: 0.008094, batch_cost: 0.2449, reader_cost: 0.12017, ips: 16.3309 samples/sec | ETA 01:04:32
2022-07-21 15:21:08 [INFO]	[TRAIN] epoch: 1, iter: 4200/20000, loss: 0.3590, lr: 0.008089, batch_cost: 0.2346, reader_cost: 0.11209, ips: 17.0525 samples/sec | ETA 01:01:46
2022-07-21 15:21:11 [INFO]	[TRAIN] epoch: 1, iter: 4210/20000, loss: 0.3962, lr: 0.008084, batch_cost: 0.2517, reader_cost: 0.13106, ips: 15.8942 samples/sec | ETA 01:06:13
2022-07-21 15:21:13 [INFO]	[TRAIN] epoch: 1, iter: 4220/20000, loss: 0.3946, lr: 0.008080, batch_cost: 0.2598, reader_cost: 0.13576, ips: 15.3950 samples/sec | ETA 01:08:20
2022-07-21 15:21:16 [INFO]	[TRAIN] epoch: 1, iter: 4230/20000, loss: 0.3711, lr: 0.008075, batch_cost: 0.2598, reader_cost: 0.13564, ips: 15.3938 samples/sec | ETA 01:08:17
2022-07-21 15:21:18 [INFO]	[TRAIN] epoch: 1, iter: 4240/20000, loss: 0.4026, lr: 0.008070, batch_cost: 0.2508, reader_cost: 0.12177, ips: 15.9476 samples/sec | ETA 01:05:52
2022-07-21 15:21:21 [INFO]	[TRAIN] epoch: 1, iter: 4250/20000, loss: 0.3796, lr: 0.008066, batch_cost: 0.2605, reader_cost: 0.12724, ips: 15.3562 samples/sec | ETA 01:08:22
2022-07-21 15:21:24 [INFO]	[TRAIN] epoch: 1, iter: 4260/20000, loss: 0.4999, lr: 0.008061, batch_cost: 0.3219, reader_cost: 0.17991, ips: 12.4259 samples/sec | ETA 01:24:26
2022-07-21 15:21:27 [INFO]	[TRAIN] epoch: 1, iter: 4270/20000, loss: 0.3328, lr: 0.008057, batch_cost: 0.2818, reader_cost: 0.12974, ips: 14.1920 samples/sec | ETA 01:13:53
2022-07-21 15:21:30 [INFO]	[TRAIN] epoch: 1, iter: 4280/20000, loss: 0.4521, lr: 0.008052, batch_cost: 0.2502, reader_cost: 0.12727, ips: 15.9880 samples/sec | ETA 01:05:32
2022-07-21 15:21:32 [INFO]	[TRAIN] epoch: 1, iter: 4290/20000, loss: 0.4581, lr: 0.008047, batch_cost: 0.2867, reader_cost: 0.13596, ips: 13.9509 samples/sec | ETA 01:15:04
2022-07-21 15:21:36 [INFO]	[TRAIN] epoch: 1, iter: 4300/20000, loss: 0.3726, lr: 0.008043, batch_cost: 0.3300, reader_cost: 0.16152, ips: 12.1207 samples/sec | ETA 01:26:21
2022-07-21 15:21:38 [INFO]	[TRAIN] epoch: 1, iter: 4310/20000, loss: 0.6045, lr: 0.008038, batch_cost: 0.2383, reader_cost: 0.11248, ips: 16.7838 samples/sec | ETA 01:02:19
2022-07-21 15:21:40 [INFO]	[TRAIN] epoch: 1, iter: 4320/20000, loss: 0.4439, lr: 0.008034, batch_cost: 0.2365, reader_cost: 0.11470, ips: 16.9160 samples/sec | ETA 01:01:47
2022-07-21 15:21:43 [INFO]	[TRAIN] epoch: 1, iter: 4330/20000, loss: 0.3517, lr: 0.008029, batch_cost: 0.2284, reader_cost: 0.10412, ips: 17.5161 samples/sec | ETA 00:59:38
2022-07-21 15:21:45 [INFO]	[TRAIN] epoch: 1, iter: 4340/20000, loss: 0.5329, lr: 0.008024, batch_cost: 0.2268, reader_cost: 0.10427, ips: 17.6353 samples/sec | ETA 00:59:11
2022-07-21 15:21:47 [INFO]	[TRAIN] epoch: 1, iter: 4350/20000, loss: 0.3495, lr: 0.008020, batch_cost: 0.2364, reader_cost: 0.11460, ips: 16.9189 samples/sec | ETA 01:01:40
2022-07-21 15:21:50 [INFO]	[TRAIN] epoch: 1, iter: 4360/20000, loss: 0.5058, lr: 0.008015, batch_cost: 0.2537, reader_cost: 0.13252, ips: 15.7667 samples/sec | ETA 01:06:07
2022-07-21 15:21:52 [INFO]	[TRAIN] epoch: 1, iter: 4370/20000, loss: 0.3799, lr: 0.008011, batch_cost: 0.2418, reader_cost: 0.11095, ips: 16.5424 samples/sec | ETA 01:02:59
2022-07-21 15:21:55 [INFO]	[TRAIN] epoch: 1, iter: 4380/20000, loss: 0.5155, lr: 0.008006, batch_cost: 0.2774, reader_cost: 0.12058, ips: 14.4204 samples/sec | ETA 01:12:12
2022-07-21 15:21:58 [INFO]	[TRAIN] epoch: 1, iter: 4390/20000, loss: 0.4763, lr: 0.008001, batch_cost: 0.2552, reader_cost: 0.11810, ips: 15.6755 samples/sec | ETA 01:06:23
2022-07-21 15:22:00 [INFO]	[TRAIN] epoch: 1, iter: 4400/20000, loss: 0.4425, lr: 0.007997, batch_cost: 0.2436, reader_cost: 0.12187, ips: 16.4211 samples/sec | ETA 01:03:19
2022-07-21 15:22:02 [INFO]	[TRAIN] epoch: 1, iter: 4410/20000, loss: 0.5651, lr: 0.007992, batch_cost: 0.2346, reader_cost: 0.11142, ips: 17.0487 samples/sec | ETA 01:00:57
2022-07-21 15:22:05 [INFO]	[TRAIN] epoch: 1, iter: 4420/20000, loss: 0.3894, lr: 0.007987, batch_cost: 0.2465, reader_cost: 0.12326, ips: 16.2270 samples/sec | ETA 01:04:00
2022-07-21 15:22:07 [INFO]	[TRAIN] epoch: 1, iter: 4430/20000, loss: 0.4367, lr: 0.007983, batch_cost: 0.2346, reader_cost: 0.11006, ips: 17.0527 samples/sec | ETA 01:00:52
2022-07-21 15:22:10 [INFO]	[TRAIN] epoch: 1, iter: 4440/20000, loss: 0.2988, lr: 0.007978, batch_cost: 0.2385, reader_cost: 0.11422, ips: 16.7681 samples/sec | ETA 01:01:51
2022-07-21 15:22:12 [INFO]	[TRAIN] epoch: 1, iter: 4450/20000, loss: 0.4290, lr: 0.007974, batch_cost: 0.2541, reader_cost: 0.12599, ips: 15.7427 samples/sec | ETA 01:05:51
2022-07-21 15:22:14 [INFO]	[TRAIN] epoch: 1, iter: 4460/20000, loss: 0.3366, lr: 0.007969, batch_cost: 0.2334, reader_cost: 0.11216, ips: 17.1343 samples/sec | ETA 01:00:27
2022-07-21 15:22:17 [INFO]	[TRAIN] epoch: 1, iter: 4470/20000, loss: 0.4225, lr: 0.007964, batch_cost: 0.2452, reader_cost: 0.12210, ips: 16.3165 samples/sec | ETA 01:03:27
2022-07-21 15:22:19 [INFO]	[TRAIN] epoch: 1, iter: 4480/20000, loss: 0.4718, lr: 0.007960, batch_cost: 0.2474, reader_cost: 0.12182, ips: 16.1667 samples/sec | ETA 01:03:59
2022-07-21 15:22:22 [INFO]	[TRAIN] epoch: 1, iter: 4490/20000, loss: 0.6429, lr: 0.007955, batch_cost: 0.2372, reader_cost: 0.11396, ips: 16.8617 samples/sec | ETA 01:01:19
2022-07-21 15:22:24 [INFO]	[TRAIN] epoch: 1, iter: 4500/20000, loss: 0.3741, lr: 0.007951, batch_cost: 0.2342, reader_cost: 0.11046, ips: 17.0814 samples/sec | ETA 01:00:29
2022-07-21 15:22:27 [INFO]	[TRAIN] epoch: 1, iter: 4510/20000, loss: 0.3108, lr: 0.007946, batch_cost: 0.2546, reader_cost: 0.12051, ips: 15.7129 samples/sec | ETA 01:05:43
2022-07-21 15:22:30 [INFO]	[TRAIN] epoch: 1, iter: 4520/20000, loss: 0.4806, lr: 0.007941, batch_cost: 0.3030, reader_cost: 0.13438, ips: 13.2015 samples/sec | ETA 01:18:10
2022-07-21 15:22:32 [INFO]	[TRAIN] epoch: 1, iter: 4530/20000, loss: 0.4332, lr: 0.007937, batch_cost: 0.2358, reader_cost: 0.11306, ips: 16.9655 samples/sec | ETA 01:00:47
2022-07-21 15:22:34 [INFO]	[TRAIN] epoch: 1, iter: 4540/20000, loss: 0.3897, lr: 0.007932, batch_cost: 0.2113, reader_cost: 0.08985, ips: 18.9276 samples/sec | ETA 00:54:27
2022-07-21 15:22:37 [INFO]	[TRAIN] epoch: 1, iter: 4550/20000, loss: 0.5403, lr: 0.007927, batch_cost: 0.2399, reader_cost: 0.11759, ips: 16.6760 samples/sec | ETA 01:01:45
2022-07-21 15:22:40 [INFO]	[TRAIN] epoch: 1, iter: 4560/20000, loss: 0.4583, lr: 0.007923, batch_cost: 0.3525, reader_cost: 0.17278, ips: 11.3465 samples/sec | ETA 01:30:43
2022-07-21 15:22:43 [INFO]	[TRAIN] epoch: 1, iter: 4570/20000, loss: 0.4106, lr: 0.007918, batch_cost: 0.2860, reader_cost: 0.13546, ips: 13.9867 samples/sec | ETA 01:13:32
2022-07-21 15:22:45 [INFO]	[TRAIN] epoch: 1, iter: 4580/20000, loss: 0.4482, lr: 0.007914, batch_cost: 0.2330, reader_cost: 0.10870, ips: 17.1683 samples/sec | ETA 00:59:52
2022-07-21 15:22:48 [INFO]	[TRAIN] epoch: 1, iter: 4590/20000, loss: 0.3926, lr: 0.007909, batch_cost: 0.2403, reader_cost: 0.11609, ips: 16.6447 samples/sec | ETA 01:01:43
2022-07-21 15:22:50 [INFO]	[TRAIN] epoch: 1, iter: 4600/20000, loss: 0.4208, lr: 0.007904, batch_cost: 0.2475, reader_cost: 0.12371, ips: 16.1630 samples/sec | ETA 01:03:31
2022-07-21 15:22:53 [INFO]	[TRAIN] epoch: 1, iter: 4610/20000, loss: 0.3772, lr: 0.007900, batch_cost: 0.2413, reader_cost: 0.11739, ips: 16.5758 samples/sec | ETA 01:01:53
2022-07-21 15:22:55 [INFO]	[TRAIN] epoch: 1, iter: 4620/20000, loss: 0.4719, lr: 0.007895, batch_cost: 0.2432, reader_cost: 0.12082, ips: 16.4495 samples/sec | ETA 01:02:19
2022-07-21 15:22:57 [INFO]	[TRAIN] epoch: 1, iter: 4630/20000, loss: 0.3381, lr: 0.007891, batch_cost: 0.2352, reader_cost: 0.10806, ips: 17.0055 samples/sec | ETA 01:00:15
2022-07-21 15:23:00 [INFO]	[TRAIN] epoch: 1, iter: 4640/20000, loss: 0.3651, lr: 0.007886, batch_cost: 0.3026, reader_cost: 0.15394, ips: 13.2187 samples/sec | ETA 01:17:27
2022-07-21 15:23:03 [INFO]	[TRAIN] epoch: 1, iter: 4650/20000, loss: 0.3644, lr: 0.007881, batch_cost: 0.2598, reader_cost: 0.13391, ips: 15.3975 samples/sec | ETA 01:06:27
2022-07-21 15:23:05 [INFO]	[TRAIN] epoch: 1, iter: 4660/20000, loss: 0.3640, lr: 0.007877, batch_cost: 0.2270, reader_cost: 0.10084, ips: 17.6210 samples/sec | ETA 00:58:02
2022-07-21 15:23:08 [INFO]	[TRAIN] epoch: 1, iter: 4670/20000, loss: 0.3817, lr: 0.007872, batch_cost: 0.2335, reader_cost: 0.11240, ips: 17.1295 samples/sec | ETA 00:59:39
2022-07-21 15:23:10 [INFO]	[TRAIN] epoch: 1, iter: 4680/20000, loss: 0.3787, lr: 0.007867, batch_cost: 0.2304, reader_cost: 0.10769, ips: 17.3606 samples/sec | ETA 00:58:49
2022-07-21 15:23:12 [INFO]	[TRAIN] epoch: 1, iter: 4690/20000, loss: 0.5417, lr: 0.007863, batch_cost: 0.2359, reader_cost: 0.10951, ips: 16.9552 samples/sec | ETA 01:00:11
2022-07-21 15:23:15 [INFO]	[TRAIN] epoch: 1, iter: 4700/20000, loss: 0.5064, lr: 0.007858, batch_cost: 0.2376, reader_cost: 0.11494, ips: 16.8332 samples/sec | ETA 01:00:35
2022-07-21 15:23:17 [INFO]	[TRAIN] epoch: 1, iter: 4710/20000, loss: 0.3518, lr: 0.007854, batch_cost: 0.2593, reader_cost: 0.13122, ips: 15.4260 samples/sec | ETA 01:06:04
2022-07-21 15:23:20 [INFO]	[TRAIN] epoch: 1, iter: 4720/20000, loss: 0.4202, lr: 0.007849, batch_cost: 0.2330, reader_cost: 0.10972, ips: 17.1640 samples/sec | ETA 00:59:20
2022-07-21 15:23:22 [INFO]	[TRAIN] epoch: 1, iter: 4730/20000, loss: 0.3870, lr: 0.007844, batch_cost: 0.2290, reader_cost: 0.10748, ips: 17.4673 samples/sec | ETA 00:58:16
2022-07-21 15:23:24 [INFO]	[TRAIN] epoch: 1, iter: 4740/20000, loss: 0.4262, lr: 0.007840, batch_cost: 0.2359, reader_cost: 0.11272, ips: 16.9558 samples/sec | ETA 00:59:59
2022-07-21 15:23:27 [INFO]	[TRAIN] epoch: 1, iter: 4750/20000, loss: 0.4623, lr: 0.007835, batch_cost: 0.2536, reader_cost: 0.12991, ips: 15.7749 samples/sec | ETA 01:04:26
2022-07-21 15:23:29 [INFO]	[TRAIN] epoch: 1, iter: 4760/20000, loss: 0.3557, lr: 0.007830, batch_cost: 0.2325, reader_cost: 0.10936, ips: 17.2047 samples/sec | ETA 00:59:03
2022-07-21 15:23:33 [INFO]	[TRAIN] epoch: 1, iter: 4770/20000, loss: 0.3968, lr: 0.007826, batch_cost: 0.3587, reader_cost: 0.19119, ips: 11.1526 samples/sec | ETA 01:31:02
2022-07-21 15:23:35 [INFO]	[TRAIN] epoch: 1, iter: 4780/20000, loss: 0.3876, lr: 0.007821, batch_cost: 0.2411, reader_cost: 0.11723, ips: 16.5895 samples/sec | ETA 01:01:09
2022-07-21 15:23:37 [INFO]	[TRAIN] epoch: 1, iter: 4790/20000, loss: 0.4948, lr: 0.007817, batch_cost: 0.2378, reader_cost: 0.11361, ips: 16.8206 samples/sec | ETA 01:00:16
2022-07-21 15:23:40 [INFO]	[TRAIN] epoch: 1, iter: 4800/20000, loss: 0.4025, lr: 0.007812, batch_cost: 0.2321, reader_cost: 0.11236, ips: 17.2319 samples/sec | ETA 00:58:48
2022-07-21 15:23:42 [INFO]	[TRAIN] epoch: 1, iter: 4810/20000, loss: 0.4704, lr: 0.007807, batch_cost: 0.2390, reader_cost: 0.11306, ips: 16.7381 samples/sec | ETA 01:00:30
2022-07-21 15:23:45 [INFO]	[TRAIN] epoch: 1, iter: 4820/20000, loss: 0.4029, lr: 0.007803, batch_cost: 0.2696, reader_cost: 0.12564, ips: 14.8395 samples/sec | ETA 01:08:11
2022-07-21 15:23:49 [INFO]	[TRAIN] epoch: 1, iter: 4830/20000, loss: 0.3120, lr: 0.007798, batch_cost: 0.3699, reader_cost: 0.19596, ips: 10.8132 samples/sec | ETA 01:33:31
2022-07-21 15:23:51 [INFO]	[TRAIN] epoch: 1, iter: 4840/20000, loss: 0.3889, lr: 0.007793, batch_cost: 0.2478, reader_cost: 0.11739, ips: 16.1434 samples/sec | ETA 01:02:36
2022-07-21 15:23:53 [INFO]	[TRAIN] epoch: 1, iter: 4850/20000, loss: 0.4272, lr: 0.007789, batch_cost: 0.2416, reader_cost: 0.11928, ips: 16.5564 samples/sec | ETA 01:01:00
2022-07-21 15:23:56 [INFO]	[TRAIN] epoch: 1, iter: 4860/20000, loss: 0.3627, lr: 0.007784, batch_cost: 0.2444, reader_cost: 0.11907, ips: 16.3670 samples/sec | ETA 01:01:40
2022-07-21 15:23:59 [INFO]	[TRAIN] epoch: 1, iter: 4870/20000, loss: 0.2982, lr: 0.007780, batch_cost: 0.2653, reader_cost: 0.14112, ips: 15.0777 samples/sec | ETA 01:06:53
2022-07-21 15:24:01 [INFO]	[TRAIN] epoch: 1, iter: 4880/20000, loss: 0.4014, lr: 0.007775, batch_cost: 0.2387, reader_cost: 0.11632, ips: 16.7559 samples/sec | ETA 01:00:09
2022-07-21 15:24:04 [INFO]	[TRAIN] epoch: 1, iter: 4890/20000, loss: 0.3431, lr: 0.007770, batch_cost: 0.2816, reader_cost: 0.14058, ips: 14.2043 samples/sec | ETA 01:10:55
2022-07-21 15:24:07 [INFO]	[TRAIN] epoch: 1, iter: 4900/20000, loss: 0.4587, lr: 0.007766, batch_cost: 0.2847, reader_cost: 0.13325, ips: 14.0504 samples/sec | ETA 01:11:38
2022-07-21 15:24:09 [INFO]	[TRAIN] epoch: 1, iter: 4910/20000, loss: 0.4679, lr: 0.007761, batch_cost: 0.2309, reader_cost: 0.10332, ips: 17.3207 samples/sec | ETA 00:58:04
2022-07-21 15:24:11 [INFO]	[TRAIN] epoch: 1, iter: 4920/20000, loss: 0.4062, lr: 0.007756, batch_cost: 0.2365, reader_cost: 0.11466, ips: 16.9165 samples/sec | ETA 00:59:25
2022-07-21 15:24:14 [INFO]	[TRAIN] epoch: 1, iter: 4930/20000, loss: 0.4924, lr: 0.007752, batch_cost: 0.2490, reader_cost: 0.12430, ips: 16.0635 samples/sec | ETA 01:02:32
2022-07-21 15:24:16 [INFO]	[TRAIN] epoch: 1, iter: 4940/20000, loss: 0.3202, lr: 0.007747, batch_cost: 0.2442, reader_cost: 0.11640, ips: 16.3817 samples/sec | ETA 01:01:17
2022-07-21 15:24:19 [INFO]	[TRAIN] epoch: 1, iter: 4950/20000, loss: 0.3110, lr: 0.007743, batch_cost: 0.2419, reader_cost: 0.11826, ips: 16.5386 samples/sec | ETA 01:00:39
2022-07-21 15:24:21 [INFO]	[TRAIN] epoch: 2, iter: 4960/20000, loss: 0.3453, lr: 0.007738, batch_cost: 0.2513, reader_cost: 0.12764, ips: 15.9186 samples/sec | ETA 01:02:59
2022-07-21 15:24:24 [INFO]	[TRAIN] epoch: 2, iter: 4970/20000, loss: 0.3896, lr: 0.007733, batch_cost: 0.2472, reader_cost: 0.12488, ips: 16.1810 samples/sec | ETA 01:01:55
2022-07-21 15:24:26 [INFO]	[TRAIN] epoch: 2, iter: 4980/20000, loss: 0.3915, lr: 0.007729, batch_cost: 0.2407, reader_cost: 0.11640, ips: 16.6167 samples/sec | ETA 01:00:15
2022-07-21 15:24:28 [INFO]	[TRAIN] epoch: 2, iter: 4990/20000, loss: 0.3838, lr: 0.007724, batch_cost: 0.2362, reader_cost: 0.11349, ips: 16.9383 samples/sec | ETA 00:59:04
2022-07-21 15:24:31 [INFO]	[TRAIN] epoch: 2, iter: 5000/20000, loss: 0.4880, lr: 0.007719, batch_cost: 0.2301, reader_cost: 0.10802, ips: 17.3823 samples/sec | ETA 00:57:31
2022-07-21 15:24:33 [INFO]	[TRAIN] epoch: 2, iter: 5010/20000, loss: 0.5577, lr: 0.007715, batch_cost: 0.2503, reader_cost: 0.11797, ips: 15.9839 samples/sec | ETA 01:02:31
2022-07-21 15:24:36 [INFO]	[TRAIN] epoch: 2, iter: 5020/20000, loss: 0.4198, lr: 0.007710, batch_cost: 0.3103, reader_cost: 0.15212, ips: 12.8890 samples/sec | ETA 01:17:28
2022-07-21 15:24:39 [INFO]	[TRAIN] epoch: 2, iter: 5030/20000, loss: 0.4269, lr: 0.007705, batch_cost: 0.2253, reader_cost: 0.10158, ips: 17.7536 samples/sec | ETA 00:56:12
2022-07-21 15:24:41 [INFO]	[TRAIN] epoch: 2, iter: 5040/20000, loss: 0.4499, lr: 0.007701, batch_cost: 0.2338, reader_cost: 0.10715, ips: 17.1055 samples/sec | ETA 00:58:18
2022-07-21 15:24:43 [INFO]	[TRAIN] epoch: 2, iter: 5050/20000, loss: 0.5697, lr: 0.007696, batch_cost: 0.2442, reader_cost: 0.11889, ips: 16.3796 samples/sec | ETA 01:00:50
2022-07-21 15:24:46 [INFO]	[TRAIN] epoch: 2, iter: 5060/20000, loss: 0.3750, lr: 0.007692, batch_cost: 0.2414, reader_cost: 0.11405, ips: 16.5717 samples/sec | ETA 01:00:06
2022-07-21 15:24:48 [INFO]	[TRAIN] epoch: 2, iter: 5070/20000, loss: 0.3640, lr: 0.007687, batch_cost: 0.2328, reader_cost: 0.11094, ips: 17.1837 samples/sec | ETA 00:57:55
2022-07-21 15:24:51 [INFO]	[TRAIN] epoch: 2, iter: 5080/20000, loss: 0.4524, lr: 0.007682, batch_cost: 0.2508, reader_cost: 0.11668, ips: 15.9501 samples/sec | ETA 01:02:21
2022-07-21 15:24:54 [INFO]	[TRAIN] epoch: 2, iter: 5090/20000, loss: 0.4370, lr: 0.007678, batch_cost: 0.3474, reader_cost: 0.16383, ips: 11.5148 samples/sec | ETA 01:26:19
2022-07-21 15:24:57 [INFO]	[TRAIN] epoch: 2, iter: 5100/20000, loss: 0.4431, lr: 0.007673, batch_cost: 0.2545, reader_cost: 0.11567, ips: 15.7173 samples/sec | ETA 01:03:12
2022-07-21 15:24:59 [INFO]	[TRAIN] epoch: 2, iter: 5110/20000, loss: 0.4494, lr: 0.007668, batch_cost: 0.2393, reader_cost: 0.11772, ips: 16.7159 samples/sec | ETA 00:59:23
2022-07-21 15:25:01 [INFO]	[TRAIN] epoch: 2, iter: 5120/20000, loss: 0.3455, lr: 0.007664, batch_cost: 0.2320, reader_cost: 0.10694, ips: 17.2451 samples/sec | ETA 00:57:31
2022-07-21 15:25:04 [INFO]	[TRAIN] epoch: 2, iter: 5130/20000, loss: 0.4059, lr: 0.007659, batch_cost: 0.2335, reader_cost: 0.11132, ips: 17.1300 samples/sec | ETA 00:57:52
2022-07-21 15:25:06 [INFO]	[TRAIN] epoch: 2, iter: 5140/20000, loss: 0.3808, lr: 0.007654, batch_cost: 0.2498, reader_cost: 0.11316, ips: 16.0106 samples/sec | ETA 01:01:52
2022-07-21 15:25:09 [INFO]	[TRAIN] epoch: 2, iter: 5150/20000, loss: 0.3546, lr: 0.007650, batch_cost: 0.3274, reader_cost: 0.17860, ips: 12.2160 samples/sec | ETA 01:21:02
2022-07-21 15:25:12 [INFO]	[TRAIN] epoch: 2, iter: 5160/20000, loss: 0.5153, lr: 0.007645, batch_cost: 0.2604, reader_cost: 0.12662, ips: 15.3600 samples/sec | ETA 01:04:24
2022-07-21 15:25:14 [INFO]	[TRAIN] epoch: 2, iter: 5170/20000, loss: 0.3590, lr: 0.007641, batch_cost: 0.2227, reader_cost: 0.08812, ips: 17.9603 samples/sec | ETA 00:55:02
2022-07-21 15:25:17 [INFO]	[TRAIN] epoch: 2, iter: 5180/20000, loss: 0.4540, lr: 0.007636, batch_cost: 0.2344, reader_cost: 0.10723, ips: 17.0627 samples/sec | ETA 00:57:54
2022-07-21 15:25:19 [INFO]	[TRAIN] epoch: 2, iter: 5190/20000, loss: 0.3632, lr: 0.007631, batch_cost: 0.2203, reader_cost: 0.09800, ips: 18.1534 samples/sec | ETA 00:54:23
2022-07-21 15:25:21 [INFO]	[TRAIN] epoch: 2, iter: 5200/20000, loss: 0.3836, lr: 0.007627, batch_cost: 0.2218, reader_cost: 0.09968, ips: 18.0377 samples/sec | ETA 00:54:42
2022-07-21 15:25:24 [INFO]	[TRAIN] epoch: 2, iter: 5210/20000, loss: 0.4591, lr: 0.007622, batch_cost: 0.2462, reader_cost: 0.12128, ips: 16.2443 samples/sec | ETA 01:00:41
2022-07-21 15:25:26 [INFO]	[TRAIN] epoch: 2, iter: 5220/20000, loss: 0.4158, lr: 0.007617, batch_cost: 0.2339, reader_cost: 0.10969, ips: 17.1016 samples/sec | ETA 00:57:36
2022-07-21 15:25:28 [INFO]	[TRAIN] epoch: 2, iter: 5230/20000, loss: 0.4235, lr: 0.007613, batch_cost: 0.2411, reader_cost: 0.11464, ips: 16.5930 samples/sec | ETA 00:59:20
2022-07-21 15:25:31 [INFO]	[TRAIN] epoch: 2, iter: 5240/20000, loss: 0.4470, lr: 0.007608, batch_cost: 0.2321, reader_cost: 0.10978, ips: 17.2309 samples/sec | ETA 00:57:06
2022-07-21 15:25:33 [INFO]	[TRAIN] epoch: 2, iter: 5250/20000, loss: 0.3240, lr: 0.007603, batch_cost: 0.2226, reader_cost: 0.09752, ips: 17.9702 samples/sec | ETA 00:54:43
2022-07-21 15:25:35 [INFO]	[TRAIN] epoch: 2, iter: 5260/20000, loss: 0.3996, lr: 0.007599, batch_cost: 0.2412, reader_cost: 0.11783, ips: 16.5856 samples/sec | ETA 00:59:14
2022-07-21 15:25:38 [INFO]	[TRAIN] epoch: 2, iter: 5270/20000, loss: 0.4645, lr: 0.007594, batch_cost: 0.2768, reader_cost: 0.12642, ips: 14.4506 samples/sec | ETA 01:07:57
2022-07-21 15:25:41 [INFO]	[TRAIN] epoch: 2, iter: 5280/20000, loss: 0.3042, lr: 0.007590, batch_cost: 0.2855, reader_cost: 0.15084, ips: 14.0118 samples/sec | ETA 01:10:02
2022-07-21 15:25:43 [INFO]	[TRAIN] epoch: 2, iter: 5290/20000, loss: 0.4297, lr: 0.007585, batch_cost: 0.2351, reader_cost: 0.11156, ips: 17.0176 samples/sec | ETA 00:57:37
2022-07-21 15:25:46 [INFO]	[TRAIN] epoch: 2, iter: 5300/20000, loss: 0.3662, lr: 0.007580, batch_cost: 0.2427, reader_cost: 0.11558, ips: 16.4800 samples/sec | ETA 00:59:27
2022-07-21 15:25:48 [INFO]	[TRAIN] epoch: 2, iter: 5310/20000, loss: 0.5296, lr: 0.007576, batch_cost: 0.2444, reader_cost: 0.11804, ips: 16.3650 samples/sec | ETA 00:59:50
2022-07-21 15:25:50 [INFO]	[TRAIN] epoch: 2, iter: 5320/20000, loss: 0.4344, lr: 0.007571, batch_cost: 0.2142, reader_cost: 0.09279, ips: 18.6700 samples/sec | ETA 00:52:25
2022-07-21 15:25:53 [INFO]	[TRAIN] epoch: 2, iter: 5330/20000, loss: 0.3390, lr: 0.007566, batch_cost: 0.2476, reader_cost: 0.12470, ips: 16.1583 samples/sec | ETA 01:00:31
2022-07-21 15:25:55 [INFO]	[TRAIN] epoch: 2, iter: 5340/20000, loss: 0.3630, lr: 0.007562, batch_cost: 0.2242, reader_cost: 0.10039, ips: 17.8440 samples/sec | ETA 00:54:46
2022-07-21 15:25:58 [INFO]	[TRAIN] epoch: 2, iter: 5350/20000, loss: 0.4521, lr: 0.007557, batch_cost: 0.2557, reader_cost: 0.11479, ips: 15.6448 samples/sec | ETA 01:02:25
2022-07-21 15:26:01 [INFO]	[TRAIN] epoch: 2, iter: 5360/20000, loss: 0.3585, lr: 0.007552, batch_cost: 0.3688, reader_cost: 0.16836, ips: 10.8465 samples/sec | ETA 01:29:58
2022-07-21 15:26:04 [INFO]	[TRAIN] epoch: 2, iter: 5370/20000, loss: 0.4727, lr: 0.007548, batch_cost: 0.2452, reader_cost: 0.10883, ips: 16.3104 samples/sec | ETA 00:59:47
2022-07-21 15:26:06 [INFO]	[TRAIN] epoch: 2, iter: 5380/20000, loss: 0.3542, lr: 0.007543, batch_cost: 0.2349, reader_cost: 0.11125, ips: 17.0274 samples/sec | ETA 00:57:14
2022-07-21 15:26:08 [INFO]	[TRAIN] epoch: 2, iter: 5390/20000, loss: 0.4911, lr: 0.007539, batch_cost: 0.2267, reader_cost: 0.10391, ips: 17.6426 samples/sec | ETA 00:55:12
2022-07-21 15:26:11 [INFO]	[TRAIN] epoch: 2, iter: 5400/20000, loss: 0.4155, lr: 0.007534, batch_cost: 0.2520, reader_cost: 0.11143, ips: 15.8726 samples/sec | ETA 01:01:19
2022-07-21 15:26:13 [INFO]	[TRAIN] epoch: 2, iter: 5410/20000, loss: 0.3013, lr: 0.007529, batch_cost: 0.2611, reader_cost: 0.13148, ips: 15.3219 samples/sec | ETA 01:03:28
2022-07-21 15:26:16 [INFO]	[TRAIN] epoch: 2, iter: 5420/20000, loss: 0.3551, lr: 0.007525, batch_cost: 0.2447, reader_cost: 0.11435, ips: 16.3460 samples/sec | ETA 00:59:27
2022-07-21 15:26:18 [INFO]	[TRAIN] epoch: 2, iter: 5430/20000, loss: 0.3183, lr: 0.007520, batch_cost: 0.2455, reader_cost: 0.11718, ips: 16.2943 samples/sec | ETA 00:59:36
2022-07-21 15:26:21 [INFO]	[TRAIN] epoch: 2, iter: 5440/20000, loss: 0.4429, lr: 0.007515, batch_cost: 0.2323, reader_cost: 0.10807, ips: 17.2157 samples/sec | ETA 00:56:22
2022-07-21 15:26:23 [INFO]	[TRAIN] epoch: 2, iter: 5450/20000, loss: 0.3599, lr: 0.007511, batch_cost: 0.2250, reader_cost: 0.10202, ips: 17.7795 samples/sec | ETA 00:54:33
2022-07-21 15:26:25 [INFO]	[TRAIN] epoch: 2, iter: 5460/20000, loss: 0.3519, lr: 0.007506, batch_cost: 0.2444, reader_cost: 0.12107, ips: 16.3698 samples/sec | ETA 00:59:12
2022-07-21 15:26:28 [INFO]	[TRAIN] epoch: 2, iter: 5470/20000, loss: 0.3930, lr: 0.007501, batch_cost: 0.2445, reader_cost: 0.12044, ips: 16.3598 samples/sec | ETA 00:59:12
2022-07-21 15:26:30 [INFO]	[TRAIN] epoch: 2, iter: 5480/20000, loss: 0.5024, lr: 0.007497, batch_cost: 0.2230, reader_cost: 0.10131, ips: 17.9393 samples/sec | ETA 00:53:57
2022-07-21 15:26:32 [INFO]	[TRAIN] epoch: 2, iter: 5490/20000, loss: 0.4301, lr: 0.007492, batch_cost: 0.2368, reader_cost: 0.11337, ips: 16.8949 samples/sec | ETA 00:57:15
2022-07-21 15:26:35 [INFO]	[TRAIN] epoch: 2, iter: 5500/20000, loss: 0.3975, lr: 0.007487, batch_cost: 0.2280, reader_cost: 0.10417, ips: 17.5463 samples/sec | ETA 00:55:05
2022-07-21 15:26:37 [INFO]	[TRAIN] epoch: 2, iter: 5510/20000, loss: 0.3594, lr: 0.007483, batch_cost: 0.2288, reader_cost: 0.10609, ips: 17.4852 samples/sec | ETA 00:55:14
2022-07-21 15:26:39 [INFO]	[TRAIN] epoch: 2, iter: 5520/20000, loss: 0.5003, lr: 0.007478, batch_cost: 0.2213, reader_cost: 0.09958, ips: 18.0768 samples/sec | ETA 00:53:24
2022-07-21 15:26:42 [INFO]	[TRAIN] epoch: 2, iter: 5530/20000, loss: 0.3434, lr: 0.007473, batch_cost: 0.3236, reader_cost: 0.14455, ips: 12.3601 samples/sec | ETA 01:18:02
2022-07-21 15:26:45 [INFO]	[TRAIN] epoch: 2, iter: 5540/20000, loss: 0.3275, lr: 0.007469, batch_cost: 0.2583, reader_cost: 0.12248, ips: 15.4860 samples/sec | ETA 01:02:14
2022-07-21 15:26:47 [INFO]	[TRAIN] epoch: 2, iter: 5550/20000, loss: 0.4840, lr: 0.007464, batch_cost: 0.2243, reader_cost: 0.10117, ips: 17.8333 samples/sec | ETA 00:54:01
2022-07-21 15:26:50 [INFO]	[TRAIN] epoch: 2, iter: 5560/20000, loss: 0.3587, lr: 0.007460, batch_cost: 0.2352, reader_cost: 0.11081, ips: 17.0037 samples/sec | ETA 00:56:36
2022-07-21 15:26:52 [INFO]	[TRAIN] epoch: 2, iter: 5570/20000, loss: 0.3864, lr: 0.007455, batch_cost: 0.2296, reader_cost: 0.10765, ips: 17.4183 samples/sec | ETA 00:55:13
2022-07-21 15:26:54 [INFO]	[TRAIN] epoch: 2, iter: 5580/20000, loss: 0.4177, lr: 0.007450, batch_cost: 0.2401, reader_cost: 0.11699, ips: 16.6621 samples/sec | ETA 00:57:41
2022-07-21 15:26:57 [INFO]	[TRAIN] epoch: 2, iter: 5590/20000, loss: 0.4842, lr: 0.007446, batch_cost: 0.2355, reader_cost: 0.11072, ips: 16.9887 samples/sec | ETA 00:56:32
2022-07-21 15:26:59 [INFO]	[TRAIN] epoch: 2, iter: 5600/20000, loss: 0.3961, lr: 0.007441, batch_cost: 0.2326, reader_cost: 0.10909, ips: 17.1977 samples/sec | ETA 00:55:49
2022-07-21 15:27:01 [INFO]	[TRAIN] epoch: 2, iter: 5610/20000, loss: 0.4291, lr: 0.007436, batch_cost: 0.2310, reader_cost: 0.10726, ips: 17.3165 samples/sec | ETA 00:55:23
2022-07-21 15:27:04 [INFO]	[TRAIN] epoch: 2, iter: 5620/20000, loss: 0.3863, lr: 0.007432, batch_cost: 0.2424, reader_cost: 0.11046, ips: 16.5032 samples/sec | ETA 00:58:05
2022-07-21 15:27:07 [INFO]	[TRAIN] epoch: 2, iter: 5630/20000, loss: 0.3990, lr: 0.007427, batch_cost: 0.3402, reader_cost: 0.17431, ips: 11.7574 samples/sec | ETA 01:21:28
2022-07-21 15:27:10 [INFO]	[TRAIN] epoch: 2, iter: 5640/20000, loss: 0.5615, lr: 0.007422, batch_cost: 0.2654, reader_cost: 0.12106, ips: 15.0739 samples/sec | ETA 01:03:30
2022-07-21 15:27:12 [INFO]	[TRAIN] epoch: 2, iter: 5650/20000, loss: 0.3362, lr: 0.007418, batch_cost: 0.2141, reader_cost: 0.08888, ips: 18.6825 samples/sec | ETA 00:51:12
2022-07-21 15:27:14 [INFO]	[TRAIN] epoch: 2, iter: 5660/20000, loss: 0.2867, lr: 0.007413, batch_cost: 0.2606, reader_cost: 0.11749, ips: 15.3479 samples/sec | ETA 01:02:17
2022-07-21 15:27:17 [INFO]	[TRAIN] epoch: 2, iter: 5670/20000, loss: 0.3442, lr: 0.007408, batch_cost: 0.2526, reader_cost: 0.12789, ips: 15.8345 samples/sec | ETA 01:00:19
2022-07-21 15:27:20 [INFO]	[TRAIN] epoch: 2, iter: 5680/20000, loss: 0.3033, lr: 0.007404, batch_cost: 0.2541, reader_cost: 0.12751, ips: 15.7405 samples/sec | ETA 01:00:39
2022-07-21 15:27:22 [INFO]	[TRAIN] epoch: 2, iter: 5690/20000, loss: 0.4931, lr: 0.007399, batch_cost: 0.2423, reader_cost: 0.10988, ips: 16.5063 samples/sec | ETA 00:57:47
2022-07-21 15:27:24 [INFO]	[TRAIN] epoch: 2, iter: 5700/20000, loss: 0.3469, lr: 0.007394, batch_cost: 0.2507, reader_cost: 0.11961, ips: 15.9561 samples/sec | ETA 00:59:44
2022-07-21 15:27:27 [INFO]	[TRAIN] epoch: 2, iter: 5710/20000, loss: 0.4016, lr: 0.007390, batch_cost: 0.2407, reader_cost: 0.11641, ips: 16.6188 samples/sec | ETA 00:57:19
2022-07-21 15:27:29 [INFO]	[TRAIN] epoch: 2, iter: 5720/20000, loss: 0.4506, lr: 0.007385, batch_cost: 0.2327, reader_cost: 0.10932, ips: 17.1871 samples/sec | ETA 00:55:23
2022-07-21 15:27:31 [INFO]	[TRAIN] epoch: 2, iter: 5730/20000, loss: 0.3409, lr: 0.007380, batch_cost: 0.2234, reader_cost: 0.10003, ips: 17.9068 samples/sec | ETA 00:53:07
2022-07-21 15:27:34 [INFO]	[TRAIN] epoch: 2, iter: 5740/20000, loss: 0.3458, lr: 0.007376, batch_cost: 0.2283, reader_cost: 0.10267, ips: 17.5220 samples/sec | ETA 00:54:15
2022-07-21 15:27:36 [INFO]	[TRAIN] epoch: 2, iter: 5750/20000, loss: 0.4088, lr: 0.007371, batch_cost: 0.2226, reader_cost: 0.09954, ips: 17.9681 samples/sec | ETA 00:52:52
2022-07-21 15:27:38 [INFO]	[TRAIN] epoch: 2, iter: 5760/20000, loss: 0.4165, lr: 0.007366, batch_cost: 0.2240, reader_cost: 0.10029, ips: 17.8596 samples/sec | ETA 00:53:09
2022-07-21 15:27:40 [INFO]	[TRAIN] epoch: 2, iter: 5770/20000, loss: 0.3951, lr: 0.007362, batch_cost: 0.2225, reader_cost: 0.09884, ips: 17.9736 samples/sec | ETA 00:52:46
2022-07-21 15:27:43 [INFO]	[TRAIN] epoch: 2, iter: 5780/20000, loss: 0.4514, lr: 0.007357, batch_cost: 0.2276, reader_cost: 0.10282, ips: 17.5747 samples/sec | ETA 00:53:56
2022-07-21 15:27:46 [INFO]	[TRAIN] epoch: 2, iter: 5790/20000, loss: 0.3179, lr: 0.007353, batch_cost: 0.3168, reader_cost: 0.15696, ips: 12.6277 samples/sec | ETA 01:15:01
2022-07-21 15:27:49 [INFO]	[TRAIN] epoch: 2, iter: 5800/20000, loss: 0.3213, lr: 0.007348, batch_cost: 0.2677, reader_cost: 0.12527, ips: 14.9409 samples/sec | ETA 01:03:21
2022-07-21 15:27:51 [INFO]	[TRAIN] epoch: 2, iter: 5810/20000, loss: 0.4012, lr: 0.007343, batch_cost: 0.2489, reader_cost: 0.12244, ips: 16.0678 samples/sec | ETA 00:58:52
2022-07-21 15:27:53 [INFO]	[TRAIN] epoch: 2, iter: 5820/20000, loss: 0.3272, lr: 0.007339, batch_cost: 0.2109, reader_cost: 0.08880, ips: 18.9657 samples/sec | ETA 00:49:50
2022-07-21 15:27:56 [INFO]	[TRAIN] epoch: 2, iter: 5830/20000, loss: 0.3543, lr: 0.007334, batch_cost: 0.2334, reader_cost: 0.10971, ips: 17.1361 samples/sec | ETA 00:55:07
2022-07-21 15:27:58 [INFO]	[TRAIN] epoch: 2, iter: 5840/20000, loss: 0.4056, lr: 0.007329, batch_cost: 0.2329, reader_cost: 0.11225, ips: 17.1719 samples/sec | ETA 00:54:58
2022-07-21 15:28:00 [INFO]	[TRAIN] epoch: 2, iter: 5850/20000, loss: 0.3293, lr: 0.007325, batch_cost: 0.2300, reader_cost: 0.10805, ips: 17.3876 samples/sec | ETA 00:54:15
2022-07-21 15:28:02 [INFO]	[TRAIN] epoch: 2, iter: 5860/20000, loss: 0.3254, lr: 0.007320, batch_cost: 0.2237, reader_cost: 0.09645, ips: 17.8810 samples/sec | ETA 00:52:43
2022-07-21 15:28:05 [INFO]	[TRAIN] epoch: 2, iter: 5870/20000, loss: 0.4162, lr: 0.007315, batch_cost: 0.2317, reader_cost: 0.10725, ips: 17.2670 samples/sec | ETA 00:54:33
2022-07-21 15:28:07 [INFO]	[TRAIN] epoch: 2, iter: 5880/20000, loss: 0.4265, lr: 0.007311, batch_cost: 0.2338, reader_cost: 0.11085, ips: 17.1059 samples/sec | ETA 00:55:01
2022-07-21 15:28:10 [INFO]	[TRAIN] epoch: 2, iter: 5890/20000, loss: 0.3579, lr: 0.007306, batch_cost: 0.2659, reader_cost: 0.13224, ips: 15.0448 samples/sec | ETA 01:02:31
2022-07-21 15:28:13 [INFO]	[TRAIN] epoch: 2, iter: 5900/20000, loss: 0.5291, lr: 0.007301, batch_cost: 0.3302, reader_cost: 0.15049, ips: 12.1154 samples/sec | ETA 01:17:35
2022-07-21 15:28:16 [INFO]	[TRAIN] epoch: 2, iter: 5910/20000, loss: 0.4302, lr: 0.007297, batch_cost: 0.2842, reader_cost: 0.14366, ips: 14.0762 samples/sec | ETA 01:06:43
2022-07-21 15:28:19 [INFO]	[TRAIN] epoch: 2, iter: 5920/20000, loss: 0.3429, lr: 0.007292, batch_cost: 0.2688, reader_cost: 0.12771, ips: 14.8808 samples/sec | ETA 01:03:04
2022-07-21 15:28:21 [INFO]	[TRAIN] epoch: 2, iter: 5930/20000, loss: 0.3605, lr: 0.007287, batch_cost: 0.2281, reader_cost: 0.10636, ips: 17.5348 samples/sec | ETA 00:53:29
2022-07-21 15:28:23 [INFO]	[TRAIN] epoch: 2, iter: 5940/20000, loss: 0.4836, lr: 0.007283, batch_cost: 0.2196, reader_cost: 0.09411, ips: 18.2128 samples/sec | ETA 00:51:27
2022-07-21 15:28:25 [INFO]	[TRAIN] epoch: 2, iter: 5950/20000, loss: 0.3144, lr: 0.007278, batch_cost: 0.2390, reader_cost: 0.11562, ips: 16.7351 samples/sec | ETA 00:55:58
2022-07-21 15:28:28 [INFO]	[TRAIN] epoch: 2, iter: 5960/20000, loss: 0.4308, lr: 0.007273, batch_cost: 0.2524, reader_cost: 0.12208, ips: 15.8505 samples/sec | ETA 00:59:03
2022-07-21 15:28:30 [INFO]	[TRAIN] epoch: 2, iter: 5970/20000, loss: 0.5022, lr: 0.007269, batch_cost: 0.2546, reader_cost: 0.12120, ips: 15.7133 samples/sec | ETA 00:59:31
2022-07-21 15:28:33 [INFO]	[TRAIN] epoch: 2, iter: 5980/20000, loss: 0.4935, lr: 0.007264, batch_cost: 0.2400, reader_cost: 0.11756, ips: 16.6644 samples/sec | ETA 00:56:05
2022-07-21 15:28:35 [INFO]	[TRAIN] epoch: 2, iter: 5990/20000, loss: 0.3472, lr: 0.007259, batch_cost: 0.2269, reader_cost: 0.10572, ips: 17.6270 samples/sec | ETA 00:52:59
2022-07-21 15:28:37 [INFO]	[TRAIN] epoch: 2, iter: 6000/20000, loss: 0.3631, lr: 0.007255, batch_cost: 0.2277, reader_cost: 0.10674, ips: 17.5633 samples/sec | ETA 00:53:08
2022-07-21 15:28:40 [INFO]	[TRAIN] epoch: 2, iter: 6010/20000, loss: 0.5204, lr: 0.007250, batch_cost: 0.2376, reader_cost: 0.11643, ips: 16.8355 samples/sec | ETA 00:55:23
2022-07-21 15:28:42 [INFO]	[TRAIN] epoch: 2, iter: 6020/20000, loss: 0.4191, lr: 0.007245, batch_cost: 0.2416, reader_cost: 0.11915, ips: 16.5573 samples/sec | ETA 00:56:17
2022-07-21 15:28:45 [INFO]	[TRAIN] epoch: 2, iter: 6030/20000, loss: 0.2933, lr: 0.007241, batch_cost: 0.2301, reader_cost: 0.10508, ips: 17.3858 samples/sec | ETA 00:53:34
2022-07-21 15:28:47 [INFO]	[TRAIN] epoch: 2, iter: 6040/20000, loss: 0.2813, lr: 0.007236, batch_cost: 0.2586, reader_cost: 0.13151, ips: 15.4658 samples/sec | ETA 01:00:10
2022-07-21 15:28:50 [INFO]	[TRAIN] epoch: 2, iter: 6050/20000, loss: 0.3064, lr: 0.007231, batch_cost: 0.3217, reader_cost: 0.15455, ips: 12.4344 samples/sec | ETA 01:14:47
2022-07-21 15:28:53 [INFO]	[TRAIN] epoch: 2, iter: 6060/20000, loss: 0.3696, lr: 0.007227, batch_cost: 0.2338, reader_cost: 0.11033, ips: 17.1095 samples/sec | ETA 00:54:19
2022-07-21 15:28:55 [INFO]	[TRAIN] epoch: 2, iter: 6070/20000, loss: 0.3570, lr: 0.007222, batch_cost: 0.2453, reader_cost: 0.12155, ips: 16.3089 samples/sec | ETA 00:56:56
2022-07-21 15:28:57 [INFO]	[TRAIN] epoch: 2, iter: 6080/20000, loss: 0.4339, lr: 0.007217, batch_cost: 0.2289, reader_cost: 0.10667, ips: 17.4759 samples/sec | ETA 00:53:06
2022-07-21 15:29:00 [INFO]	[TRAIN] epoch: 2, iter: 6090/20000, loss: 0.3575, lr: 0.007213, batch_cost: 0.2353, reader_cost: 0.11189, ips: 16.9995 samples/sec | ETA 00:54:33
2022-07-21 15:29:02 [INFO]	[TRAIN] epoch: 2, iter: 6100/20000, loss: 0.3377, lr: 0.007208, batch_cost: 0.2388, reader_cost: 0.11679, ips: 16.7493 samples/sec | ETA 00:55:19
2022-07-21 15:29:04 [INFO]	[TRAIN] epoch: 2, iter: 6110/20000, loss: 0.3916, lr: 0.007203, batch_cost: 0.2300, reader_cost: 0.10673, ips: 17.3877 samples/sec | ETA 00:53:15
2022-07-21 15:29:07 [INFO]	[TRAIN] epoch: 2, iter: 6120/20000, loss: 0.4877, lr: 0.007199, batch_cost: 0.2296, reader_cost: 0.10230, ips: 17.4253 samples/sec | ETA 00:53:06
2022-07-21 15:29:09 [INFO]	[TRAIN] epoch: 2, iter: 6130/20000, loss: 0.3088, lr: 0.007194, batch_cost: 0.2528, reader_cost: 0.12871, ips: 15.8211 samples/sec | ETA 00:58:26
2022-07-21 15:29:12 [INFO]	[TRAIN] epoch: 2, iter: 6140/20000, loss: 0.4555, lr: 0.007189, batch_cost: 0.2384, reader_cost: 0.11575, ips: 16.7792 samples/sec | ETA 00:55:04
2022-07-21 15:29:14 [INFO]	[TRAIN] epoch: 2, iter: 6150/20000, loss: 0.3978, lr: 0.007185, batch_cost: 0.2203, reader_cost: 0.09742, ips: 18.1553 samples/sec | ETA 00:50:51
2022-07-21 15:29:16 [INFO]	[TRAIN] epoch: 2, iter: 6160/20000, loss: 0.3870, lr: 0.007180, batch_cost: 0.2528, reader_cost: 0.12292, ips: 15.8249 samples/sec | ETA 00:58:18
2022-07-21 15:29:20 [INFO]	[TRAIN] epoch: 2, iter: 6170/20000, loss: 0.3136, lr: 0.007175, batch_cost: 0.3504, reader_cost: 0.18132, ips: 11.4162 samples/sec | ETA 01:20:45
2022-07-21 15:29:23 [INFO]	[TRAIN] epoch: 2, iter: 6180/20000, loss: 0.4404, lr: 0.007171, batch_cost: 0.3246, reader_cost: 0.16046, ips: 12.3222 samples/sec | ETA 01:14:46
2022-07-21 15:29:25 [INFO]	[TRAIN] epoch: 2, iter: 6190/20000, loss: 0.3857, lr: 0.007166, batch_cost: 0.2284, reader_cost: 0.10501, ips: 17.5103 samples/sec | ETA 00:52:34
2022-07-21 15:29:28 [INFO]	[TRAIN] epoch: 2, iter: 6200/20000, loss: 0.4218, lr: 0.007161, batch_cost: 0.2225, reader_cost: 0.09900, ips: 17.9793 samples/sec | ETA 00:51:10
2022-07-21 15:29:30 [INFO]	[TRAIN] epoch: 2, iter: 6210/20000, loss: 0.5062, lr: 0.007157, batch_cost: 0.2191, reader_cost: 0.09666, ips: 18.2534 samples/sec | ETA 00:50:21
2022-07-21 15:29:32 [INFO]	[TRAIN] epoch: 2, iter: 6220/20000, loss: 0.4042, lr: 0.007152, batch_cost: 0.2349, reader_cost: 0.11141, ips: 17.0302 samples/sec | ETA 00:53:56
2022-07-21 15:29:35 [INFO]	[TRAIN] epoch: 2, iter: 6230/20000, loss: 0.3566, lr: 0.007147, batch_cost: 0.2344, reader_cost: 0.10760, ips: 17.0631 samples/sec | ETA 00:53:48
2022-07-21 15:29:37 [INFO]	[TRAIN] epoch: 2, iter: 6240/20000, loss: 0.3222, lr: 0.007143, batch_cost: 0.2779, reader_cost: 0.15093, ips: 14.3943 samples/sec | ETA 01:03:43
2022-07-21 15:29:40 [INFO]	[TRAIN] epoch: 2, iter: 6250/20000, loss: 0.6252, lr: 0.007138, batch_cost: 0.2204, reader_cost: 0.09867, ips: 18.1477 samples/sec | ETA 00:50:30
2022-07-21 15:29:42 [INFO]	[TRAIN] epoch: 2, iter: 6260/20000, loss: 0.4654, lr: 0.007133, batch_cost: 0.2322, reader_cost: 0.10986, ips: 17.2274 samples/sec | ETA 00:53:10
2022-07-21 15:29:44 [INFO]	[TRAIN] epoch: 2, iter: 6270/20000, loss: 0.3417, lr: 0.007129, batch_cost: 0.2425, reader_cost: 0.11935, ips: 16.4967 samples/sec | ETA 00:55:29
2022-07-21 15:29:47 [INFO]	[TRAIN] epoch: 2, iter: 6280/20000, loss: 0.4055, lr: 0.007124, batch_cost: 0.2401, reader_cost: 0.11454, ips: 16.6604 samples/sec | ETA 00:54:54
2022-07-21 15:29:49 [INFO]	[TRAIN] epoch: 2, iter: 6290/20000, loss: 0.2659, lr: 0.007119, batch_cost: 0.2366, reader_cost: 0.11102, ips: 16.9042 samples/sec | ETA 00:54:04
2022-07-21 15:29:52 [INFO]	[TRAIN] epoch: 2, iter: 6300/20000, loss: 0.3332, lr: 0.007115, batch_cost: 0.2472, reader_cost: 0.10840, ips: 16.1789 samples/sec | ETA 00:56:27
2022-07-21 15:29:55 [INFO]	[TRAIN] epoch: 2, iter: 6310/20000, loss: 0.2858, lr: 0.007110, batch_cost: 0.3016, reader_cost: 0.14486, ips: 13.2616 samples/sec | ETA 01:08:49
2022-07-21 15:29:57 [INFO]	[TRAIN] epoch: 2, iter: 6320/20000, loss: 0.3522, lr: 0.007105, batch_cost: 0.2498, reader_cost: 0.12548, ips: 16.0159 samples/sec | ETA 00:56:56
2022-07-21 15:29:59 [INFO]	[TRAIN] epoch: 2, iter: 6330/20000, loss: 0.3278, lr: 0.007101, batch_cost: 0.2126, reader_cost: 0.09122, ips: 18.8142 samples/sec | ETA 00:48:26
2022-07-21 15:30:01 [INFO]	[TRAIN] epoch: 2, iter: 6340/20000, loss: 0.5020, lr: 0.007096, batch_cost: 0.2230, reader_cost: 0.09922, ips: 17.9361 samples/sec | ETA 00:50:46
2022-07-21 15:30:04 [INFO]	[TRAIN] epoch: 2, iter: 6350/20000, loss: 0.3229, lr: 0.007091, batch_cost: 0.2230, reader_cost: 0.10007, ips: 17.9377 samples/sec | ETA 00:50:43
2022-07-21 15:30:06 [INFO]	[TRAIN] epoch: 2, iter: 6360/20000, loss: 0.4023, lr: 0.007087, batch_cost: 0.2410, reader_cost: 0.11804, ips: 16.6000 samples/sec | ETA 00:54:46
2022-07-21 15:30:08 [INFO]	[TRAIN] epoch: 2, iter: 6370/20000, loss: 0.3320, lr: 0.007082, batch_cost: 0.2278, reader_cost: 0.10464, ips: 17.5570 samples/sec | ETA 00:51:45
2022-07-21 15:30:11 [INFO]	[TRAIN] epoch: 2, iter: 6380/20000, loss: 0.4018, lr: 0.007077, batch_cost: 0.2465, reader_cost: 0.12232, ips: 16.2254 samples/sec | ETA 00:55:57
2022-07-21 15:30:13 [INFO]	[TRAIN] epoch: 2, iter: 6390/20000, loss: 0.3027, lr: 0.007073, batch_cost: 0.2460, reader_cost: 0.12364, ips: 16.2610 samples/sec | ETA 00:55:47
2022-07-21 15:30:15 [INFO]	[TRAIN] epoch: 2, iter: 6400/20000, loss: 0.4074, lr: 0.007068, batch_cost: 0.2258, reader_cost: 0.10257, ips: 17.7144 samples/sec | ETA 00:51:10
2022-07-21 15:30:18 [INFO]	[TRAIN] epoch: 2, iter: 6410/20000, loss: 0.2781, lr: 0.007063, batch_cost: 0.2333, reader_cost: 0.10916, ips: 17.1432 samples/sec | ETA 00:52:50
2022-07-21 15:30:20 [INFO]	[TRAIN] epoch: 2, iter: 6420/20000, loss: 0.3350, lr: 0.007058, batch_cost: 0.2306, reader_cost: 0.10815, ips: 17.3487 samples/sec | ETA 00:52:11
2022-07-21 15:30:22 [INFO]	[TRAIN] epoch: 2, iter: 6430/20000, loss: 0.3672, lr: 0.007054, batch_cost: 0.2367, reader_cost: 0.11009, ips: 16.8966 samples/sec | ETA 00:53:32
2022-07-21 15:30:26 [INFO]	[TRAIN] epoch: 2, iter: 6440/20000, loss: 0.3798, lr: 0.007049, batch_cost: 0.3796, reader_cost: 0.16630, ips: 10.5381 samples/sec | ETA 01:25:47
2022-07-21 15:30:29 [INFO]	[TRAIN] epoch: 2, iter: 6450/20000, loss: 0.3803, lr: 0.007044, batch_cost: 0.3094, reader_cost: 0.15834, ips: 12.9279 samples/sec | ETA 01:09:52
2022-07-21 15:30:32 [INFO]	[TRAIN] epoch: 2, iter: 6460/20000, loss: 0.3558, lr: 0.007040, batch_cost: 0.2249, reader_cost: 0.10135, ips: 17.7841 samples/sec | ETA 00:50:45
2022-07-21 15:30:34 [INFO]	[TRAIN] epoch: 2, iter: 6470/20000, loss: 0.5293, lr: 0.007035, batch_cost: 0.2245, reader_cost: 0.10168, ips: 17.8178 samples/sec | ETA 00:50:37
2022-07-21 15:30:36 [INFO]	[TRAIN] epoch: 2, iter: 6480/20000, loss: 0.4786, lr: 0.007030, batch_cost: 0.2322, reader_cost: 0.10624, ips: 17.2272 samples/sec | ETA 00:52:19
2022-07-21 15:30:39 [INFO]	[TRAIN] epoch: 2, iter: 6490/20000, loss: 0.3329, lr: 0.007026, batch_cost: 0.2670, reader_cost: 0.13913, ips: 14.9824 samples/sec | ETA 01:00:06
2022-07-21 15:30:41 [INFO]	[TRAIN] epoch: 2, iter: 6500/20000, loss: 0.5508, lr: 0.007021, batch_cost: 0.2443, reader_cost: 0.11347, ips: 16.3752 samples/sec | ETA 00:54:57
2022-07-21 15:30:44 [INFO]	[TRAIN] epoch: 2, iter: 6510/20000, loss: 0.4365, lr: 0.007016, batch_cost: 0.2287, reader_cost: 0.10330, ips: 17.4884 samples/sec | ETA 00:51:25
2022-07-21 15:30:46 [INFO]	[TRAIN] epoch: 2, iter: 6520/20000, loss: 0.3806, lr: 0.007012, batch_cost: 0.2295, reader_cost: 0.10389, ips: 17.4276 samples/sec | ETA 00:51:33
2022-07-21 15:30:48 [INFO]	[TRAIN] epoch: 2, iter: 6530/20000, loss: 0.4894, lr: 0.007007, batch_cost: 0.2373, reader_cost: 0.11379, ips: 16.8541 samples/sec | ETA 00:53:16
2022-07-21 15:30:51 [INFO]	[TRAIN] epoch: 2, iter: 6540/20000, loss: 0.3412, lr: 0.007002, batch_cost: 0.2369, reader_cost: 0.11469, ips: 16.8848 samples/sec | ETA 00:53:08
2022-07-21 15:30:53 [INFO]	[TRAIN] epoch: 2, iter: 6550/20000, loss: 0.2933, lr: 0.006998, batch_cost: 0.2646, reader_cost: 0.12953, ips: 15.1166 samples/sec | ETA 00:59:19
2022-07-21 15:30:56 [INFO]	[TRAIN] epoch: 2, iter: 6560/20000, loss: 0.3020, lr: 0.006993, batch_cost: 0.2739, reader_cost: 0.13773, ips: 14.6032 samples/sec | ETA 01:01:21
2022-07-21 15:30:59 [INFO]	[TRAIN] epoch: 2, iter: 6570/20000, loss: 0.4782, lr: 0.006988, batch_cost: 0.2658, reader_cost: 0.13235, ips: 15.0464 samples/sec | ETA 00:59:30
2022-07-21 15:31:01 [INFO]	[TRAIN] epoch: 2, iter: 6580/20000, loss: 0.3459, lr: 0.006984, batch_cost: 0.2291, reader_cost: 0.10386, ips: 17.4576 samples/sec | ETA 00:51:14
2022-07-21 15:31:03 [INFO]	[TRAIN] epoch: 2, iter: 6590/20000, loss: 0.3739, lr: 0.006979, batch_cost: 0.2280, reader_cost: 0.10387, ips: 17.5423 samples/sec | ETA 00:50:57
2022-07-21 15:31:06 [INFO]	[TRAIN] epoch: 2, iter: 6600/20000, loss: 0.3185, lr: 0.006974, batch_cost: 0.2263, reader_cost: 0.10475, ips: 17.6731 samples/sec | ETA 00:50:32
2022-07-21 15:31:08 [INFO]	[TRAIN] epoch: 2, iter: 6610/20000, loss: 0.5080, lr: 0.006970, batch_cost: 0.2301, reader_cost: 0.10776, ips: 17.3827 samples/sec | ETA 00:51:21
2022-07-21 15:31:10 [INFO]	[TRAIN] epoch: 2, iter: 6620/20000, loss: 0.4798, lr: 0.006965, batch_cost: 0.2330, reader_cost: 0.11024, ips: 17.1642 samples/sec | ETA 00:51:58
2022-07-21 15:31:13 [INFO]	[TRAIN] epoch: 2, iter: 6630/20000, loss: 0.3907, lr: 0.006960, batch_cost: 0.2349, reader_cost: 0.11189, ips: 17.0318 samples/sec | ETA 00:52:20
2022-07-21 15:31:15 [INFO]	[TRAIN] epoch: 2, iter: 6640/20000, loss: 0.3904, lr: 0.006955, batch_cost: 0.2374, reader_cost: 0.11479, ips: 16.8476 samples/sec | ETA 00:52:51
2022-07-21 15:31:17 [INFO]	[TRAIN] epoch: 2, iter: 6650/20000, loss: 0.4754, lr: 0.006951, batch_cost: 0.2165, reader_cost: 0.09473, ips: 18.4784 samples/sec | ETA 00:48:09
2022-07-21 15:31:19 [INFO]	[TRAIN] epoch: 2, iter: 6660/20000, loss: 0.3973, lr: 0.006946, batch_cost: 0.2231, reader_cost: 0.09956, ips: 17.9285 samples/sec | ETA 00:49:36
2022-07-21 15:31:22 [INFO]	[TRAIN] epoch: 2, iter: 6670/20000, loss: 0.3188, lr: 0.006941, batch_cost: 0.2422, reader_cost: 0.11782, ips: 16.5133 samples/sec | ETA 00:53:48
2022-07-21 15:31:24 [INFO]	[TRAIN] epoch: 2, iter: 6680/20000, loss: 0.5073, lr: 0.006937, batch_cost: 0.2397, reader_cost: 0.11378, ips: 16.6856 samples/sec | ETA 00:53:13
2022-07-21 15:31:26 [INFO]	[TRAIN] epoch: 2, iter: 6690/20000, loss: 0.3654, lr: 0.006932, batch_cost: 0.2301, reader_cost: 0.10683, ips: 17.3831 samples/sec | ETA 00:51:02
2022-07-21 15:31:29 [INFO]	[TRAIN] epoch: 2, iter: 6700/20000, loss: 0.4763, lr: 0.006927, batch_cost: 0.2476, reader_cost: 0.11176, ips: 16.1574 samples/sec | ETA 00:54:52
2022-07-21 15:31:32 [INFO]	[TRAIN] epoch: 2, iter: 6710/20000, loss: 0.3630, lr: 0.006923, batch_cost: 0.3222, reader_cost: 0.15137, ips: 12.4137 samples/sec | ETA 01:11:22
2022-07-21 15:31:36 [INFO]	[TRAIN] epoch: 2, iter: 6720/20000, loss: 0.3713, lr: 0.006918, batch_cost: 0.3409, reader_cost: 0.17863, ips: 11.7336 samples/sec | ETA 01:15:27
2022-07-21 15:31:38 [INFO]	[TRAIN] epoch: 2, iter: 6730/20000, loss: 0.4624, lr: 0.006913, batch_cost: 0.2346, reader_cost: 0.11056, ips: 17.0489 samples/sec | ETA 00:51:53
2022-07-21 15:31:40 [INFO]	[TRAIN] epoch: 2, iter: 6740/20000, loss: 0.3269, lr: 0.006909, batch_cost: 0.2424, reader_cost: 0.12124, ips: 16.5028 samples/sec | ETA 00:53:33
2022-07-21 15:31:43 [INFO]	[TRAIN] epoch: 2, iter: 6750/20000, loss: 0.4448, lr: 0.006904, batch_cost: 0.2418, reader_cost: 0.11775, ips: 16.5428 samples/sec | ETA 00:53:23
2022-07-21 15:31:45 [INFO]	[TRAIN] epoch: 2, iter: 6760/20000, loss: 0.3073, lr: 0.006899, batch_cost: 0.2401, reader_cost: 0.11143, ips: 16.6620 samples/sec | ETA 00:52:58
2022-07-21 15:31:47 [INFO]	[TRAIN] epoch: 2, iter: 6770/20000, loss: 0.3713, lr: 0.006895, batch_cost: 0.2375, reader_cost: 0.10728, ips: 16.8456 samples/sec | ETA 00:52:21
2022-07-21 15:31:50 [INFO]	[TRAIN] epoch: 2, iter: 6780/20000, loss: 0.3410, lr: 0.006890, batch_cost: 0.2395, reader_cost: 0.11575, ips: 16.7001 samples/sec | ETA 00:52:46
2022-07-21 15:31:52 [INFO]	[TRAIN] epoch: 2, iter: 6790/20000, loss: 0.3680, lr: 0.006885, batch_cost: 0.2391, reader_cost: 0.11785, ips: 16.7317 samples/sec | ETA 00:52:38
2022-07-21 15:31:55 [INFO]	[TRAIN] epoch: 2, iter: 6800/20000, loss: 0.5045, lr: 0.006880, batch_cost: 0.2253, reader_cost: 0.10000, ips: 17.7566 samples/sec | ETA 00:49:33
2022-07-21 15:31:57 [INFO]	[TRAIN] epoch: 2, iter: 6810/20000, loss: 0.3553, lr: 0.006876, batch_cost: 0.2863, reader_cost: 0.13632, ips: 13.9714 samples/sec | ETA 01:02:56
2022-07-21 15:32:00 [INFO]	[TRAIN] epoch: 2, iter: 6820/20000, loss: 0.4394, lr: 0.006871, batch_cost: 0.2254, reader_cost: 0.09981, ips: 17.7441 samples/sec | ETA 00:49:31
2022-07-21 15:32:02 [INFO]	[TRAIN] epoch: 2, iter: 6830/20000, loss: 0.4109, lr: 0.006866, batch_cost: 0.2639, reader_cost: 0.11929, ips: 15.1562 samples/sec | ETA 00:57:55
2022-07-21 15:32:05 [INFO]	[TRAIN] epoch: 2, iter: 6840/20000, loss: 0.4091, lr: 0.006862, batch_cost: 0.2323, reader_cost: 0.11129, ips: 17.2187 samples/sec | ETA 00:50:57
2022-07-21 15:32:07 [INFO]	[TRAIN] epoch: 2, iter: 6850/20000, loss: 0.3348, lr: 0.006857, batch_cost: 0.2203, reader_cost: 0.10030, ips: 18.1532 samples/sec | ETA 00:48:17
2022-07-21 15:32:09 [INFO]	[TRAIN] epoch: 2, iter: 6860/20000, loss: 0.4871, lr: 0.006852, batch_cost: 0.2327, reader_cost: 0.10903, ips: 17.1882 samples/sec | ETA 00:50:57
2022-07-21 15:32:12 [INFO]	[TRAIN] epoch: 2, iter: 6870/20000, loss: 0.3589, lr: 0.006848, batch_cost: 0.2375, reader_cost: 0.11095, ips: 16.8451 samples/sec | ETA 00:51:57
2022-07-21 15:32:14 [INFO]	[TRAIN] epoch: 2, iter: 6880/20000, loss: 0.3127, lr: 0.006843, batch_cost: 0.2106, reader_cost: 0.08431, ips: 18.9931 samples/sec | ETA 00:46:03
2022-07-21 15:32:16 [INFO]	[TRAIN] epoch: 2, iter: 6890/20000, loss: 0.4708, lr: 0.006838, batch_cost: 0.2371, reader_cost: 0.10906, ips: 16.8734 samples/sec | ETA 00:51:47
2022-07-21 15:32:18 [INFO]	[TRAIN] epoch: 2, iter: 6900/20000, loss: 0.3590, lr: 0.006834, batch_cost: 0.2223, reader_cost: 0.09993, ips: 17.9914 samples/sec | ETA 00:48:32
2022-07-21 15:32:21 [INFO]	[TRAIN] epoch: 2, iter: 6910/20000, loss: 0.3546, lr: 0.006829, batch_cost: 0.2304, reader_cost: 0.10793, ips: 17.3615 samples/sec | ETA 00:50:15
2022-07-21 15:32:23 [INFO]	[TRAIN] epoch: 2, iter: 6920/20000, loss: 0.4434, lr: 0.006824, batch_cost: 0.2357, reader_cost: 0.10859, ips: 16.9676 samples/sec | ETA 00:51:23
2022-07-21 15:32:25 [INFO]	[TRAIN] epoch: 2, iter: 6930/20000, loss: 0.3885, lr: 0.006819, batch_cost: 0.2202, reader_cost: 0.09591, ips: 18.1615 samples/sec | ETA 00:47:58
2022-07-21 15:32:28 [INFO]	[TRAIN] epoch: 2, iter: 6940/20000, loss: 0.2846, lr: 0.006815, batch_cost: 0.2415, reader_cost: 0.11246, ips: 16.5612 samples/sec | ETA 00:52:34
2022-07-21 15:32:30 [INFO]	[TRAIN] epoch: 2, iter: 6950/20000, loss: 0.4283, lr: 0.006810, batch_cost: 0.2324, reader_cost: 0.10792, ips: 17.2152 samples/sec | ETA 00:50:32
2022-07-21 15:32:32 [INFO]	[TRAIN] epoch: 2, iter: 6960/20000, loss: 0.4757, lr: 0.006805, batch_cost: 0.2465, reader_cost: 0.11929, ips: 16.2254 samples/sec | ETA 00:53:34
2022-07-21 15:32:35 [INFO]	[TRAIN] epoch: 2, iter: 6970/20000, loss: 0.3434, lr: 0.006801, batch_cost: 0.2578, reader_cost: 0.11153, ips: 15.5159 samples/sec | ETA 00:55:59
2022-07-21 15:32:37 [INFO]	[TRAIN] epoch: 2, iter: 6980/20000, loss: 0.4000, lr: 0.006796, batch_cost: 0.2313, reader_cost: 0.10361, ips: 17.2901 samples/sec | ETA 00:50:12
2022-07-21 15:32:41 [INFO]	[TRAIN] epoch: 2, iter: 6990/20000, loss: 0.3531, lr: 0.006791, batch_cost: 0.3619, reader_cost: 0.17299, ips: 11.0534 samples/sec | ETA 01:18:28
2022-07-21 15:32:43 [INFO]	[TRAIN] epoch: 2, iter: 7000/20000, loss: 0.3181, lr: 0.006787, batch_cost: 0.2608, reader_cost: 0.12082, ips: 15.3372 samples/sec | ETA 00:56:30
2022-07-21 15:32:46 [INFO]	[TRAIN] epoch: 2, iter: 7010/20000, loss: 0.4069, lr: 0.006782, batch_cost: 0.2448, reader_cost: 0.11956, ips: 16.3398 samples/sec | ETA 00:52:59
2022-07-21 15:32:48 [INFO]	[TRAIN] epoch: 2, iter: 7020/20000, loss: 0.4162, lr: 0.006777, batch_cost: 0.2301, reader_cost: 0.10395, ips: 17.3832 samples/sec | ETA 00:49:46
2022-07-21 15:32:50 [INFO]	[TRAIN] epoch: 2, iter: 7030/20000, loss: 0.4281, lr: 0.006773, batch_cost: 0.2239, reader_cost: 0.09777, ips: 17.8620 samples/sec | ETA 00:48:24
2022-07-21 15:32:53 [INFO]	[TRAIN] epoch: 2, iter: 7040/20000, loss: 0.3480, lr: 0.006768, batch_cost: 0.2610, reader_cost: 0.13134, ips: 15.3233 samples/sec | ETA 00:56:23
2022-07-21 15:32:55 [INFO]	[TRAIN] epoch: 2, iter: 7050/20000, loss: 0.4102, lr: 0.006763, batch_cost: 0.2447, reader_cost: 0.12001, ips: 16.3442 samples/sec | ETA 00:52:49
2022-07-21 15:32:58 [INFO]	[TRAIN] epoch: 2, iter: 7060/20000, loss: 0.4236, lr: 0.006758, batch_cost: 0.2739, reader_cost: 0.13169, ips: 14.6030 samples/sec | ETA 00:59:04
2022-07-21 15:33:01 [INFO]	[TRAIN] epoch: 2, iter: 7070/20000, loss: 0.4329, lr: 0.006754, batch_cost: 0.2470, reader_cost: 0.11624, ips: 16.1927 samples/sec | ETA 00:53:14
2022-07-21 15:33:03 [INFO]	[TRAIN] epoch: 2, iter: 7080/20000, loss: 0.3161, lr: 0.006749, batch_cost: 0.2269, reader_cost: 0.10461, ips: 17.6300 samples/sec | ETA 00:48:51
2022-07-21 15:33:06 [INFO]	[TRAIN] epoch: 2, iter: 7090/20000, loss: 0.2810, lr: 0.006744, batch_cost: 0.2844, reader_cost: 0.14339, ips: 14.0633 samples/sec | ETA 01:01:11
2022-07-21 15:33:08 [INFO]	[TRAIN] epoch: 2, iter: 7100/20000, loss: 0.3926, lr: 0.006740, batch_cost: 0.2178, reader_cost: 0.09589, ips: 18.3622 samples/sec | ETA 00:46:50
2022-07-21 15:33:11 [INFO]	[TRAIN] epoch: 2, iter: 7110/20000, loss: 0.4195, lr: 0.006735, batch_cost: 0.2533, reader_cost: 0.12724, ips: 15.7908 samples/sec | ETA 00:54:25
2022-07-21 15:33:13 [INFO]	[TRAIN] epoch: 2, iter: 7120/20000, loss: 0.2977, lr: 0.006730, batch_cost: 0.2414, reader_cost: 0.11868, ips: 16.5669 samples/sec | ETA 00:51:49
2022-07-21 15:33:15 [INFO]	[TRAIN] epoch: 2, iter: 7130/20000, loss: 0.3706, lr: 0.006725, batch_cost: 0.2425, reader_cost: 0.11565, ips: 16.4922 samples/sec | ETA 00:52:01
2022-07-21 15:33:18 [INFO]	[TRAIN] epoch: 2, iter: 7140/20000, loss: 0.3393, lr: 0.006721, batch_cost: 0.2215, reader_cost: 0.09762, ips: 18.0617 samples/sec | ETA 00:47:28
2022-07-21 15:33:20 [INFO]	[TRAIN] epoch: 2, iter: 7150/20000, loss: 0.3397, lr: 0.006716, batch_cost: 0.2352, reader_cost: 0.10913, ips: 17.0082 samples/sec | ETA 00:50:22
2022-07-21 15:33:22 [INFO]	[TRAIN] epoch: 2, iter: 7160/20000, loss: 0.3851, lr: 0.006711, batch_cost: 0.2252, reader_cost: 0.10122, ips: 17.7590 samples/sec | ETA 00:48:12
2022-07-21 15:33:24 [INFO]	[TRAIN] epoch: 2, iter: 7170/20000, loss: 0.2772, lr: 0.006707, batch_cost: 0.2318, reader_cost: 0.10843, ips: 17.2552 samples/sec | ETA 00:49:34
2022-07-21 15:33:27 [INFO]	[TRAIN] epoch: 2, iter: 7180/20000, loss: 0.4299, lr: 0.006702, batch_cost: 0.2216, reader_cost: 0.09848, ips: 18.0497 samples/sec | ETA 00:47:21
2022-07-21 15:33:29 [INFO]	[TRAIN] epoch: 2, iter: 7190/20000, loss: 0.3898, lr: 0.006697, batch_cost: 0.2397, reader_cost: 0.11387, ips: 16.6887 samples/sec | ETA 00:51:10
2022-07-21 15:33:31 [INFO]	[TRAIN] epoch: 2, iter: 7200/20000, loss: 0.3092, lr: 0.006693, batch_cost: 0.2285, reader_cost: 0.10607, ips: 17.5063 samples/sec | ETA 00:48:44
2022-07-21 15:33:34 [INFO]	[TRAIN] epoch: 2, iter: 7210/20000, loss: 0.3152, lr: 0.006688, batch_cost: 0.2418, reader_cost: 0.11595, ips: 16.5425 samples/sec | ETA 00:51:32
2022-07-21 15:33:36 [INFO]	[TRAIN] epoch: 2, iter: 7220/20000, loss: 0.3713, lr: 0.006683, batch_cost: 0.2498, reader_cost: 0.11376, ips: 16.0160 samples/sec | ETA 00:53:11
2022-07-21 15:33:39 [INFO]	[TRAIN] epoch: 2, iter: 7230/20000, loss: 0.5055, lr: 0.006678, batch_cost: 0.2535, reader_cost: 0.12327, ips: 15.7762 samples/sec | ETA 00:53:57
2022-07-21 15:33:41 [INFO]	[TRAIN] epoch: 2, iter: 7240/20000, loss: 0.4321, lr: 0.006674, batch_cost: 0.2336, reader_cost: 0.10987, ips: 17.1228 samples/sec | ETA 00:49:40
2022-07-21 15:33:44 [INFO]	[TRAIN] epoch: 2, iter: 7250/20000, loss: 0.3176, lr: 0.006669, batch_cost: 0.2408, reader_cost: 0.10967, ips: 16.6090 samples/sec | ETA 00:51:10
2022-07-21 15:33:47 [INFO]	[TRAIN] epoch: 2, iter: 7260/20000, loss: 0.3554, lr: 0.006664, batch_cost: 0.3600, reader_cost: 0.17208, ips: 11.1109 samples/sec | ETA 01:16:26
2022-07-21 15:33:50 [INFO]	[TRAIN] epoch: 2, iter: 7270/20000, loss: 0.3413, lr: 0.006660, batch_cost: 0.2777, reader_cost: 0.12316, ips: 14.4033 samples/sec | ETA 00:58:55
2022-07-21 15:33:52 [INFO]	[TRAIN] epoch: 2, iter: 7280/20000, loss: 0.3042, lr: 0.006655, batch_cost: 0.2337, reader_cost: 0.11097, ips: 17.1188 samples/sec | ETA 00:49:32
2022-07-21 15:33:55 [INFO]	[TRAIN] epoch: 2, iter: 7290/20000, loss: 0.3453, lr: 0.006650, batch_cost: 0.2533, reader_cost: 0.12670, ips: 15.7913 samples/sec | ETA 00:53:39
2022-07-21 15:33:58 [INFO]	[TRAIN] epoch: 2, iter: 7300/20000, loss: 0.3940, lr: 0.006645, batch_cost: 0.2672, reader_cost: 0.10734, ips: 14.9722 samples/sec | ETA 00:56:32
2022-07-21 15:34:00 [INFO]	[TRAIN] epoch: 2, iter: 7310/20000, loss: 0.4133, lr: 0.006641, batch_cost: 0.2700, reader_cost: 0.12531, ips: 14.8164 samples/sec | ETA 00:57:05
2022-07-21 15:34:03 [INFO]	[TRAIN] epoch: 2, iter: 7320/20000, loss: 0.3483, lr: 0.006636, batch_cost: 0.2414, reader_cost: 0.11520, ips: 16.5671 samples/sec | ETA 00:51:01
2022-07-21 15:34:05 [INFO]	[TRAIN] epoch: 2, iter: 7330/20000, loss: 0.3672, lr: 0.006631, batch_cost: 0.2220, reader_cost: 0.09866, ips: 18.0156 samples/sec | ETA 00:46:53
2022-07-21 15:34:07 [INFO]	[TRAIN] epoch: 2, iter: 7340/20000, loss: 0.3119, lr: 0.006627, batch_cost: 0.2347, reader_cost: 0.10806, ips: 17.0433 samples/sec | ETA 00:49:31
2022-07-21 15:34:10 [INFO]	[TRAIN] epoch: 2, iter: 7350/20000, loss: 0.3093, lr: 0.006622, batch_cost: 0.2608, reader_cost: 0.12743, ips: 15.3397 samples/sec | ETA 00:54:58
2022-07-21 15:34:12 [INFO]	[TRAIN] epoch: 2, iter: 7360/20000, loss: 0.3871, lr: 0.006617, batch_cost: 0.2405, reader_cost: 0.11722, ips: 16.6308 samples/sec | ETA 00:50:40
2022-07-21 15:34:14 [INFO]	[TRAIN] epoch: 2, iter: 7370/20000, loss: 0.4515, lr: 0.006613, batch_cost: 0.2287, reader_cost: 0.10703, ips: 17.4892 samples/sec | ETA 00:48:08
2022-07-21 15:34:17 [INFO]	[TRAIN] epoch: 2, iter: 7380/20000, loss: 0.3057, lr: 0.006608, batch_cost: 0.2244, reader_cost: 0.10009, ips: 17.8228 samples/sec | ETA 00:47:12
2022-07-21 15:34:19 [INFO]	[TRAIN] epoch: 2, iter: 7390/20000, loss: 0.3382, lr: 0.006603, batch_cost: 0.2351, reader_cost: 0.10947, ips: 17.0145 samples/sec | ETA 00:49:24
2022-07-21 15:34:21 [INFO]	[TRAIN] epoch: 2, iter: 7400/20000, loss: 0.2837, lr: 0.006598, batch_cost: 0.2228, reader_cost: 0.09866, ips: 17.9522 samples/sec | ETA 00:46:47
2022-07-21 15:34:24 [INFO]	[TRAIN] epoch: 2, iter: 7410/20000, loss: 0.3093, lr: 0.006594, batch_cost: 0.2476, reader_cost: 0.12520, ips: 16.1520 samples/sec | ETA 00:51:57
2022-07-21 15:34:26 [INFO]	[TRAIN] epoch: 2, iter: 7420/20000, loss: 0.3254, lr: 0.006589, batch_cost: 0.2503, reader_cost: 0.12771, ips: 15.9809 samples/sec | ETA 00:52:28
2022-07-21 15:34:29 [INFO]	[TRAIN] epoch: 2, iter: 7430/20000, loss: 0.3488, lr: 0.006584, batch_cost: 0.2514, reader_cost: 0.12685, ips: 15.9080 samples/sec | ETA 00:52:40
2022-07-21 15:34:31 [INFO]	[TRAIN] epoch: 2, iter: 7440/20000, loss: 0.3824, lr: 0.006580, batch_cost: 0.2377, reader_cost: 0.11510, ips: 16.8279 samples/sec | ETA 00:49:45
2022-07-21 15:34:33 [INFO]	[TRAIN] epoch: 2, iter: 7450/20000, loss: 0.4773, lr: 0.006575, batch_cost: 0.2243, reader_cost: 0.09735, ips: 17.8353 samples/sec | ETA 00:46:54
2022-07-21 15:34:36 [INFO]	[TRAIN] epoch: 2, iter: 7460/20000, loss: 0.4035, lr: 0.006570, batch_cost: 0.2231, reader_cost: 0.09667, ips: 17.9265 samples/sec | ETA 00:46:38
2022-07-21 15:34:38 [INFO]	[TRAIN] epoch: 2, iter: 7470/20000, loss: 0.4438, lr: 0.006565, batch_cost: 0.2397, reader_cost: 0.11276, ips: 16.6844 samples/sec | ETA 00:50:04
2022-07-21 15:34:41 [INFO]	[TRAIN] epoch: 2, iter: 7480/20000, loss: 0.4695, lr: 0.006561, batch_cost: 0.2707, reader_cost: 0.12761, ips: 14.7776 samples/sec | ETA 00:56:28
2022-07-21 15:34:43 [INFO]	[TRAIN] epoch: 2, iter: 7490/20000, loss: 0.3455, lr: 0.006556, batch_cost: 0.2425, reader_cost: 0.11669, ips: 16.4946 samples/sec | ETA 00:50:33
2022-07-21 15:34:46 [INFO]	[TRAIN] epoch: 2, iter: 7500/20000, loss: 0.3670, lr: 0.006551, batch_cost: 0.2513, reader_cost: 0.12014, ips: 15.9160 samples/sec | ETA 00:52:21
2022-07-21 15:34:48 [INFO]	[TRAIN] epoch: 2, iter: 7510/20000, loss: 0.3403, lr: 0.006547, batch_cost: 0.2347, reader_cost: 0.10734, ips: 17.0463 samples/sec | ETA 00:48:50
2022-07-21 15:34:50 [INFO]	[TRAIN] epoch: 2, iter: 7520/20000, loss: 0.3827, lr: 0.006542, batch_cost: 0.2336, reader_cost: 0.10740, ips: 17.1268 samples/sec | ETA 00:48:34
2022-07-21 15:34:54 [INFO]	[TRAIN] epoch: 2, iter: 7530/20000, loss: 0.3406, lr: 0.006537, batch_cost: 0.3588, reader_cost: 0.17673, ips: 11.1481 samples/sec | ETA 01:14:34
2022-07-21 15:34:57 [INFO]	[TRAIN] epoch: 2, iter: 7540/20000, loss: 0.3298, lr: 0.006532, batch_cost: 0.2528, reader_cost: 0.10945, ips: 15.8209 samples/sec | ETA 00:52:30
2022-07-21 15:34:59 [INFO]	[TRAIN] epoch: 2, iter: 7550/20000, loss: 0.3142, lr: 0.006528, batch_cost: 0.2399, reader_cost: 0.11716, ips: 16.6760 samples/sec | ETA 00:49:46
2022-07-21 15:35:01 [INFO]	[TRAIN] epoch: 2, iter: 7560/20000, loss: 0.4109, lr: 0.006523, batch_cost: 0.2498, reader_cost: 0.11896, ips: 16.0115 samples/sec | ETA 00:51:47
2022-07-21 15:35:04 [INFO]	[TRAIN] epoch: 2, iter: 7570/20000, loss: 0.4276, lr: 0.006518, batch_cost: 0.2987, reader_cost: 0.16518, ips: 13.3921 samples/sec | ETA 01:01:52
2022-07-21 15:35:07 [INFO]	[TRAIN] epoch: 2, iter: 7580/20000, loss: 0.4390, lr: 0.006513, batch_cost: 0.2366, reader_cost: 0.11201, ips: 16.9075 samples/sec | ETA 00:48:58
2022-07-21 15:35:09 [INFO]	[TRAIN] epoch: 2, iter: 7590/20000, loss: 0.3332, lr: 0.006509, batch_cost: 0.2265, reader_cost: 0.10376, ips: 17.6634 samples/sec | ETA 00:46:50
2022-07-21 15:35:12 [INFO]	[TRAIN] epoch: 2, iter: 7600/20000, loss: 0.3464, lr: 0.006504, batch_cost: 0.2777, reader_cost: 0.13289, ips: 14.4023 samples/sec | ETA 00:57:23
2022-07-21 15:35:14 [INFO]	[TRAIN] epoch: 2, iter: 7610/20000, loss: 0.3708, lr: 0.006499, batch_cost: 0.2247, reader_cost: 0.09532, ips: 17.8030 samples/sec | ETA 00:46:23
2022-07-21 15:35:16 [INFO]	[TRAIN] epoch: 2, iter: 7620/20000, loss: 0.3830, lr: 0.006495, batch_cost: 0.2380, reader_cost: 0.11453, ips: 16.8087 samples/sec | ETA 00:49:06
2022-07-21 15:35:19 [INFO]	[TRAIN] epoch: 2, iter: 7630/20000, loss: 0.4397, lr: 0.006490, batch_cost: 0.2215, reader_cost: 0.09795, ips: 18.0593 samples/sec | ETA 00:45:39
2022-07-21 15:35:21 [INFO]	[TRAIN] epoch: 2, iter: 7640/20000, loss: 0.3209, lr: 0.006485, batch_cost: 0.2401, reader_cost: 0.11539, ips: 16.6577 samples/sec | ETA 00:49:27
2022-07-21 15:35:23 [INFO]	[TRAIN] epoch: 2, iter: 7650/20000, loss: 0.3499, lr: 0.006480, batch_cost: 0.2323, reader_cost: 0.11072, ips: 17.2181 samples/sec | ETA 00:47:49
2022-07-21 15:35:26 [INFO]	[TRAIN] epoch: 2, iter: 7660/20000, loss: 0.4862, lr: 0.006476, batch_cost: 0.2282, reader_cost: 0.10474, ips: 17.5320 samples/sec | ETA 00:46:55
2022-07-21 15:35:28 [INFO]	[TRAIN] epoch: 2, iter: 7670/20000, loss: 0.2943, lr: 0.006471, batch_cost: 0.2314, reader_cost: 0.10949, ips: 17.2882 samples/sec | ETA 00:47:32
2022-07-21 15:35:30 [INFO]	[TRAIN] epoch: 2, iter: 7680/20000, loss: 0.5478, lr: 0.006466, batch_cost: 0.2173, reader_cost: 0.09371, ips: 18.4044 samples/sec | ETA 00:44:37
2022-07-21 15:35:32 [INFO]	[TRAIN] epoch: 2, iter: 7690/20000, loss: 0.4967, lr: 0.006462, batch_cost: 0.2220, reader_cost: 0.09944, ips: 18.0159 samples/sec | ETA 00:45:33
2022-07-21 15:35:35 [INFO]	[TRAIN] epoch: 2, iter: 7700/20000, loss: 0.3755, lr: 0.006457, batch_cost: 0.2333, reader_cost: 0.10901, ips: 17.1437 samples/sec | ETA 00:47:49
2022-07-21 15:35:37 [INFO]	[TRAIN] epoch: 2, iter: 7710/20000, loss: 0.4112, lr: 0.006452, batch_cost: 0.2435, reader_cost: 0.12051, ips: 16.4247 samples/sec | ETA 00:49:53
2022-07-21 15:35:39 [INFO]	[TRAIN] epoch: 2, iter: 7720/20000, loss: 0.4762, lr: 0.006447, batch_cost: 0.2273, reader_cost: 0.09972, ips: 17.6010 samples/sec | ETA 00:46:30
2022-07-21 15:35:42 [INFO]	[TRAIN] epoch: 2, iter: 7730/20000, loss: 0.4150, lr: 0.006443, batch_cost: 0.2274, reader_cost: 0.10446, ips: 17.5882 samples/sec | ETA 00:46:30
2022-07-21 15:35:44 [INFO]	[TRAIN] epoch: 2, iter: 7740/20000, loss: 0.4047, lr: 0.006438, batch_cost: 0.2722, reader_cost: 0.12576, ips: 14.6930 samples/sec | ETA 00:55:37
2022-07-21 15:35:47 [INFO]	[TRAIN] epoch: 2, iter: 7750/20000, loss: 0.2710, lr: 0.006433, batch_cost: 0.2150, reader_cost: 0.08840, ips: 18.6030 samples/sec | ETA 00:43:53
2022-07-21 15:35:49 [INFO]	[TRAIN] epoch: 2, iter: 7760/20000, loss: 0.2710, lr: 0.006428, batch_cost: 0.2315, reader_cost: 0.10775, ips: 17.2759 samples/sec | ETA 00:47:14
2022-07-21 15:35:51 [INFO]	[TRAIN] epoch: 2, iter: 7770/20000, loss: 0.3553, lr: 0.006424, batch_cost: 0.2330, reader_cost: 0.11110, ips: 17.1694 samples/sec | ETA 00:47:29
2022-07-21 15:35:53 [INFO]	[TRAIN] epoch: 2, iter: 7780/20000, loss: 0.3030, lr: 0.006419, batch_cost: 0.2246, reader_cost: 0.10236, ips: 17.8115 samples/sec | ETA 00:45:44
2022-07-21 15:35:56 [INFO]	[TRAIN] epoch: 2, iter: 7790/20000, loss: 0.3578, lr: 0.006414, batch_cost: 0.2449, reader_cost: 0.11758, ips: 16.3347 samples/sec | ETA 00:49:49
2022-07-21 15:35:59 [INFO]	[TRAIN] epoch: 2, iter: 7800/20000, loss: 0.4215, lr: 0.006410, batch_cost: 0.3072, reader_cost: 0.14530, ips: 13.0201 samples/sec | ETA 01:02:28
2022-07-21 15:36:02 [INFO]	[TRAIN] epoch: 2, iter: 7810/20000, loss: 0.4009, lr: 0.006405, batch_cost: 0.3266, reader_cost: 0.16279, ips: 12.2478 samples/sec | ETA 01:06:21
2022-07-21 15:36:05 [INFO]	[TRAIN] epoch: 2, iter: 7820/20000, loss: 0.3907, lr: 0.006400, batch_cost: 0.2655, reader_cost: 0.11928, ips: 15.0649 samples/sec | ETA 00:53:54
2022-07-21 15:36:07 [INFO]	[TRAIN] epoch: 2, iter: 7830/20000, loss: 0.3368, lr: 0.006395, batch_cost: 0.2361, reader_cost: 0.10799, ips: 16.9426 samples/sec | ETA 00:47:53
2022-07-21 15:36:10 [INFO]	[TRAIN] epoch: 2, iter: 7840/20000, loss: 0.4152, lr: 0.006391, batch_cost: 0.2572, reader_cost: 0.12811, ips: 15.5531 samples/sec | ETA 00:52:07
2022-07-21 15:36:12 [INFO]	[TRAIN] epoch: 2, iter: 7850/20000, loss: 0.3609, lr: 0.006386, batch_cost: 0.2520, reader_cost: 0.12739, ips: 15.8737 samples/sec | ETA 00:51:01
2022-07-21 15:36:15 [INFO]	[TRAIN] epoch: 2, iter: 7860/20000, loss: 0.4380, lr: 0.006381, batch_cost: 0.2521, reader_cost: 0.10999, ips: 15.8697 samples/sec | ETA 00:50:59
2022-07-21 15:36:17 [INFO]	[TRAIN] epoch: 2, iter: 7870/20000, loss: 0.3375, lr: 0.006376, batch_cost: 0.2436, reader_cost: 0.11675, ips: 16.4235 samples/sec | ETA 00:49:14
2022-07-21 15:36:20 [INFO]	[TRAIN] epoch: 2, iter: 7880/20000, loss: 0.5077, lr: 0.006372, batch_cost: 0.2351, reader_cost: 0.11235, ips: 17.0105 samples/sec | ETA 00:47:30
2022-07-21 15:36:22 [INFO]	[TRAIN] epoch: 2, iter: 7890/20000, loss: 0.4537, lr: 0.006367, batch_cost: 0.2258, reader_cost: 0.10388, ips: 17.7144 samples/sec | ETA 00:45:34
2022-07-21 15:36:24 [INFO]	[TRAIN] epoch: 2, iter: 7900/20000, loss: 0.4222, lr: 0.006362, batch_cost: 0.2440, reader_cost: 0.11669, ips: 16.3932 samples/sec | ETA 00:49:12
2022-07-21 15:36:27 [INFO]	[TRAIN] epoch: 2, iter: 7910/20000, loss: 0.3258, lr: 0.006358, batch_cost: 0.2438, reader_cost: 0.11675, ips: 16.4040 samples/sec | ETA 00:49:08
2022-07-21 15:36:29 [INFO]	[TRAIN] epoch: 2, iter: 7920/20000, loss: 0.4069, lr: 0.006353, batch_cost: 0.2255, reader_cost: 0.10406, ips: 17.7389 samples/sec | ETA 00:45:23
2022-07-21 15:36:32 [INFO]	[TRAIN] epoch: 2, iter: 7930/20000, loss: 0.3951, lr: 0.006348, batch_cost: 0.2476, reader_cost: 0.12355, ips: 16.1568 samples/sec | ETA 00:49:48
2022-07-21 15:36:34 [INFO]	[TRAIN] epoch: 2, iter: 7940/20000, loss: 0.5528, lr: 0.006343, batch_cost: 0.2230, reader_cost: 0.09922, ips: 17.9347 samples/sec | ETA 00:44:49
2022-07-21 15:36:36 [INFO]	[TRAIN] epoch: 2, iter: 7950/20000, loss: 0.2804, lr: 0.006339, batch_cost: 0.2310, reader_cost: 0.10744, ips: 17.3191 samples/sec | ETA 00:46:23
2022-07-21 15:36:38 [INFO]	[TRAIN] epoch: 2, iter: 7960/20000, loss: 0.3241, lr: 0.006334, batch_cost: 0.2275, reader_cost: 0.10315, ips: 17.5800 samples/sec | ETA 00:45:39
2022-07-21 15:36:41 [INFO]	[TRAIN] epoch: 2, iter: 7970/20000, loss: 0.3166, lr: 0.006329, batch_cost: 0.2419, reader_cost: 0.11946, ips: 16.5373 samples/sec | ETA 00:48:29
2022-07-21 15:36:43 [INFO]	[TRAIN] epoch: 2, iter: 7980/20000, loss: 0.4185, lr: 0.006324, batch_cost: 0.2416, reader_cost: 0.11746, ips: 16.5543 samples/sec | ETA 00:48:24
2022-07-21 15:36:46 [INFO]	[TRAIN] epoch: 2, iter: 7990/20000, loss: 0.2762, lr: 0.006320, batch_cost: 0.2585, reader_cost: 0.13001, ips: 15.4717 samples/sec | ETA 00:51:45
2022-07-21 15:36:48 [INFO]	[TRAIN] epoch: 2, iter: 8000/20000, loss: 0.3836, lr: 0.006315, batch_cost: 0.2449, reader_cost: 0.10571, ips: 16.3356 samples/sec | ETA 00:48:58
2022-07-21 15:36:48 [INFO]	Start evaluating (total_samples: 7361, total_iters: 7361)...
7361/7361 [==============================] - ETA: 0s - batch_cost: 0.0636 - reader cost: 0.00790   - ETA: 7:21 - batch_cost: 0.0608 - ETA: 6:59 - batch_cost: 0.0581 - reader cost: 0.0 - ETA: 6:58 - batch_cost: 0.058 - ETA: 6:46 - batch_cost: 0.0566 - reader cost - ETA: 6:41 - batch_cost: 0.0561 - ETA: 6:35 - batch_cost: 0.0555 - reader cost: - ETA: 6:38 - batch_cost: 0.0560 - reader cost:  - ETA: 6:41 - batch_cost: 0.0566 - reader c - ETA: 6:56 - batch_cost: 0.0586 - reader cost: 0 - ETA: 7:08 - batch_cost: 0.0603 - reader cost: 0 - ETA: 7:21 - batch_cost: 0.0622 - reader cost: - ETA: 7:34 - batch_cost: 0.0641 - reader cost: 0.005 - ETA: 7:36 - batch_cost: 0.0644  - ETA: 7:47 - batch_cost: 0.0661 - reader cost - ETA: 7:47 - batch_cost: 0.0663 - reader cost: - ETA: 7:49 - batch_cost: 0.0665 - reader cost: 0 - ETA: 7:49 - ETA: 7:54 - batch_cost: 0.0678 - reade - ETA: 7:55 - batch_cost: 0.0681 - reader cost: 0.009 - ETA: 7:55 - batch_cost: 0.0681 - reader cost: 0.00 - ETA: 7:55 - batch_cost: 0.0681 - reader cost: 0. - ETA: 7:56 - batch_cost: 0.0683 - reader cost: 0.00 - ETA: 7:57 - batch_cost: 0.0684 - reader cost - ETA: 8:01 - batch_cost: 0.0692 - reader cost: - ETA: 8:05 - batch_ - ETA: 8:03 - batch_cost: 0.0699 - reader cost: 0. - ETA: 8:03 - batch_cost: 0.0698 - reader cost: - ETA: 8:02 - batch_cost: 0.0698 -  - ETA: 8:02 - batch_cost: - ETA: 7:52 - batch_cost: 0.0689 -  - ETA: 7:41 -  - ETA: 7:22 - batch_cost: 0.0657 - reader cost: 0.00 - ETA: 7:22 - batch_cost: 0.0656 - reader co - ETA: 7:18 - batch_cost: 0.0653 - reader cost: 0. - ETA: 7:17 - batch_cost: 0.0651 - reader cost - ETA: - ETA: 7:11 - batch_cost:  - ETA: 7:02 - batch_cost: 0.0641 - reader c - ETA: 6:58 - batch_cost: 0.0637 - reader cost - ETA: 6:56 - batch_cost: 0.0635 - reader cost: 0.008 - ETA: 6:55 - batch_cost: 0.0634 - reader  - ETA: 6:51 - batch_cost: 0.0630 - reader cost - ETA: 6:49 - batch_cost: 0.0628 - reader cost: 0.00 - ETA: 6:48 - batch_cost: 0.0628 - reader cost: 0. - ETA: 6:47 - batch_cost: 0.0627 - reader cost: 0.0 - ETA: 6:47 -  - ETA: 6:37 - batch_cost: 0.0619 - reader cost: 0. - ETA: 6:36 - batch_cost: 0.0618 - reader cost: - ETA: 6:36 - batch_cost: 0.0619 - reader cost: 0. - ETA: 6:36 - ETA: 6:29 - ETA: 6:19 - batch_cost: 0.0607 - reader cost - ETA: 6:17 - batch_cost: 0.0605 - reader cost - ETA: 6:15 - batch_cost: 0.0604 - reader cost: - ETA: 6:14 - batch_cost: 0.0603 - reader cost: 0 - ETA: 6:13 - batch_cost: 0.0602 - reader c - ETA: 6:10 - batch_cost: 0.0600 - reader c - ETA: 6:08 - batch_cost: 0.0598 - reader cost: - ETA: 6:06 -  - ETA: 6:01 - batch_cost: 0.0594 - reader cos - ETA: 6:01 - batch_cost:  - ETA: 6:01 - batch_cost: 0.0599 - reader cost: - ETA: 6:02 - batch_cost: 0.0601 - reader cost - ETA: 6:03 - batch_cost: 0.0604 - reader cost - ETA: 6:05 - batch_cost: 0.0607 - reader cost: 0. - ETA: 6:06 - batch_cost: 0.0609 - reader cost: 0.006 - ETA: 6:06 - batch_cost: 0.0610 - reader cost: 0.0 - ETA: 6:07 - batch_cost: 0.0612 - reader cost: 0 - ETA: 6:08 - batch_cost: 0.0613 - reader cost: 0.00 - ETA: 6:08 - batch_cost: 0.0614 - reader  - ETA: 6:11 - batch_cost: 0.0621 - reader co - ETA: 6:12 - batch_cost: 0.0624 - reader cost: 0.0 - ETA: 6:13 - batch_cost - ETA: 6:13 - batch_c - ETA: 6:14 - batch_cost: 0.0633 - r - ETA: 6:12 - batch_cost: 0.0633 - ETA: 6:06 - batch_cost: 0.0629 - reader co - ETA: 5:53 - batch_cost: 0.0619 - reader cost:  - ETA: 5:52 - batch_cost: 0.0618 - r - ETA: 5:48 - batch_cost: 0 - ETA: 5 - ETA: 5:35 - batch_cost: 0.0606 - reader cost - ETA: 5:33 - batch_cost: 0.0605 - read - ETA: 5:31 - batch_cos - ETA: 5:30 - batch_cost: 0.0605 - reade - ETA: 5:29 - batch_cost: 0.0606 - reader cost: 0. - ETA: 5:29 - batch_cost: 0. - ETA: 5:27 - batch_cost: 0.0607 - reader cost: 0.00 - ETA: 5:27 - batch_cost: 0.0607 - reader cost: 0.0 - ETA: 5:27 - batch_cost: 0.0607 - read - ETA: 5:25 - batch_cost: 0. - ETA: 5:24 - ETA: 5:23 - batch_cost: 0.0612 - reader cost: 0. - ETA: 5:23 - batch_cost: 0.0612 - reader cost: - E - ETA: 5:15 - batch_cost: 0.0614 - reader co - ETA: 5:14 - batch_cost: 0.0614 - reader cost: 0.0 - ETA: 5:14 - batch_cost: 0.0614 - reader - ETA: 5:13 - batch_cost: 0.0615 - reader cost:  - ETA: 5:12 - batch_cost: 0.0615  - ETA: 5:11 - batch_cost: 0.0615 - reader cost: 0.0 - ETA: 5:11 - batch_cost: 0.0616 - reader cost: 0.0 - ETA: 5:11 - batch_cost: 0.0617 - re - ETA: 5:08 - batch_cost: 0.0620 - reader cost: 0.0 - ETA: 5:08 - batch_cost: 0.0621 - reader cost - ETA: 5:08 - batch_cost: 0.0622 - reader cost:  - ETA: 5:08 - batch_cost: 0.0623 - read - ETA: 5:09 - batch_cost: 0.0626 - reader  - ETA: 5:09 - batch_cost: 0.0629 -  - ETA: 5:08 - batch_cost: 0.0630 - reader cost - ETA: 5:08 - batch_cost: 0.0631 - reader cost: 0. - ETA: 5:08 - batch_cost: 0.0632 - reader cost: 0.0 - ETA: - ETA: 5:01 - batch_cost: 0.0634 - reader cost: 0 - ETA: 5:00 - batch_cost: 0.0634 - reader  - ETA: 4:59 - batch_cost: 0.0634 - reader cost:  - ETA: 4:59 - batch_cost: 0.0634 - reader cost: - ETA: 4:59 - batch_cost: 0.0634 - reader cost: 0.00 - ETA: 4:59 - batch_cost: 0.0634 - reader cost: 0.00 - ETA: 4:58 - batch_cost: 0.0634 - reader cost - ETA: 4:58 - batch_cost: 0.0635 - reader cost: 0 - ETA: 4:58 - batch_cost: 0.0 - ETA: 4:51 - batch_cost: 0.0636 - reader cos - ETA: 4:51 - batch_cost: 0.0636 - reader cos - ETA: 4:50 - batch - ETA: 4:47 - batch_cost: 0.0636 - reader - ETA: 4:46 - batch_cost: 0.0636 - reader c - ETA: 4:45 - batch_co - ETA: 4:43 - batch_cost: - ETA: 4:41 - batch_cost: 0.0637 - reader cost:  - ETA: 4:41 - batch_cost: 0.0637 - reader cost: 0.009 - ETA: 4:41 - batch_cost: 0.0638 - reader cost: 0. - ETA: 4:41 - bat - ETA: 4:38 - batch_cost: 0.0639 - reader cost: - ETA: 4:38 - batch_cost: 0.0639 - rea - ETA: 4:36 - batch_cost: 0.0639 - reader cost: 0 - ETA: 4:36 - batch_co -  - ETA: 4:30 - batch_cost: 0.0639 - reader cos - ETA: 4:29 - batch_cost: 0.0639 - reader cost: 0.009 - ETA: 4:29 - batch_c - ETA: 4:26 - batch_cost: 0.0640  - ETA: 4:26 - batch_cost: 0.0642 - reader cost: - ETA: 4:25 - batch_cos - ETA: 4:23 - batch_cost: 0.0642 - reader cost: 0.009 - ETA: 4:23 - batch_cost: 0.0642 - rea - ETA: 4:21 - batch_cost: 0.0642 - reader cost:  - ETA: 4:21 - batch_cost:  - ETA: 4:19 - batch_cost: 0.0642  - ETA: 4:17 - batch_cost: 0.0643 - reader cost: 0.0 - ETA: 4:17 - batch_cost: 0.0643 - reader cost - ETA: 4:17 - batch_cost: 0.0644 - reader cost: 0. - ETA: 4:17 - batch_cost: 0.0644 - read - ETA: 4:17 - batch_cost: 0.0647 - reader cos - ETA: 4:17 - batch_cost: 0.0649 - reader cost:  - ETA: 4:17 - batch_cost: 0.0650 - reader cost: 0.01 - ETA: 4:17 - batch_cos - ETA: 4:15 - batch_cost: 0.0651 - r - ETA: 4:14 - batch_cost: 0.0651 - reader cos - ETA: 4:13 - batch_cost: 0.0651 - reader cost: 0. - ETA: 4:12 - - ETA: 4:09 - batch_cost: 0.0652 - re - ETA: 4:08 - batch_cost: 0.0652 - reader cost: 0.010 - ETA: 4:08 - batch_cost: 0.0652 - reader cost:  - ETA: 4:07 - batch_cost: 0.0652 - read - ETA: 4:06 - batch_cost: 0.0652 -  - ETA: 4:05 - batch_cost: 0.0652 - reader  - ETA: 4:04 - batch_cost: 0.0652 - reader cost: 0. - ETA: 4:04 - batch_co - ETA: 4:01 - batch_cost: 0.0652 - reader cost:  - ETA: 4:01 - batch_cost: 0.0652 - reader cost: 0.0 - ETA: 4:0 - ETA: 3:56 - batch_cost: 0.0651 - reader cost: 0.01 - ETA: 3:56 - batch_cost: 0.0651 - reader - ETA: 3:55 - batch_cost: 0.0651 - reader co - ETA: 3:54 - batch_cost: 0.0651 - reader cost: - ETA: 3:53 - batch_cost: 0.0651 - reader cost: 0.010 - ETA: 3:53 - batch_cost: 0.0651 - reader cost: 0.010 - ETA: 3:53 - batch_cost:  - ETA: 3:50 - batch_cost: 0.0651 - reader cost - ETA: 3:50 - batch_cost: 0 - ETA: 3:47 - batch_cost: 0.0651 - reader cost: 0.010 - ETA: 3:47 - batch_cost: 0.0651 - read - ETA: 3:46 - batch_cost: 0.0651 - reader cost: 0 - ETA: 3:46 - batch_cost: 0.0652 - reader cost - ETA: 3:46 - batch_cost: 0.0652 - reader cost: 0. - ETA: 3:45 - batch_cost: 0.0652 - reader cost: 0 - ETA: 3:45 - batch_cost: 0.0652 - reader cost: 0 - ETA: 3:45 - batch_cost: 0.0652 - reader cost - ETA: 3:44 - batch_cost: 0.0652 - reader cost: 0. - ETA: 3:44 - batch_cost: 0.0652 - reader cost: 0.01 - ETA: 3:44 - batch_cost: 0.0652 -  - ETA: 3:42 - batch_cost: 0.0652 - reader cost: 0 - ETA: 3:41 -  - ETA: 3:38  - ETA: 3:34 - batch_cost: 0.0651 - reader cos - ETA: 3:33 - batch_cost: 0.0652 - reader cost: 0. - ETA: 3:33 - batch_cost: 0.0652 - reader cost: 0.010 - ETA: 3:33 - batch_cost: 0.0652 - rea - ETA: 3:32 - batch_cost: 0.0653 - reader - ETA: 3:31 - batch_cost: 0.0653 - reader cost - ETA: 3:31 - batch_cost: 0.0653 - reader cost: 0. - ETA: 3:30 - batch_cost: 0.06 - ETA: 3:28 - batch_cost: 0.0653 - reader cost: 0 - ETA: 3:28 - batch_ - ETA: 3:25 - batch_cost: 0.0653 - reader cost: 0 - ETA: 3:25 - batch_cost: 0.0 - ETA: 3:23 - batch_cost: 0.0653 - reader cost: 0. - ETA: 3:22 - batch_cost: 0.0653 -  - ETA: 3:21 - batch_cost: 0.0653 - reader cost: 0.01 - ETA: 3:20 - batch_cost: 0.0653 - reader c - ETA: 3:19 - batch_cost: 0.0653 - read - ETA: 3:18 - batch_cost: 0.0653 - reader cost: 0.0 - ETA: 3:18 - batch_cost: 0.0653 - reader cost: 0.01 - ETA: 3:18 - batch_cost: 0.0653 - reader cos - ETA: 3:17 - batch_cost: 0.0653 - reader cost: 0. - ETA: 3:17 - batch_cost: 0.0653 - reader cost: - ETA: 3:17 - batch_cost: 0.0655 - reader cost: 0 - ETA: 3:17 - batch_cost: 0.0656 - reader cost: 0. - ETA: 3:17 - batch_cost: 0.0657 - reader cost: 0 - ETA: 3:17 - batch_cost: 0.0658 - reader cost: 0.01 - ETA: 3:17 - batch_cost: 0.0658 - reader  - ETA: 3:17 - batch_cost: 0.0660 - reader cost - ETA: 3:16 - batch_cost: 0.0661 - reader cost: 0.0 - ETA: 3:16 - batch_cost: 0.0661 - reader cost - ETA: 3:15 - batch_cost: 0.0661 - reader co - ETA: 3:14 - batch_cost: 0.0661 - reader cost: 0.011 - ETA: 3:14 - batch_cost: 0.0661 - reader cost: - ETA: 3:14 - batch_cost: 0.0661 - reader cost: 0. - ETA: 3:13 - batch_cost: 0.0661 - reader cost: 0.011 - ETA: 3:13 - batch_cost: 0.0661 - reader cost: 0 - ETA: 3:13 - batch_cost: 0.0661 - reader cost: 0.0 - ETA: 3:13 - batch_cost: 0.0661 - reader cost: 0. - ETA: 3:12 - batc - ETA: 3:10 - batch_cost: 0.0661 - rea - ETA: 3:08 - batch_cost: 0.0661 - reader cost: 0.01 - ETA: 3:08 - batch_cost: 0.0661 - reader cost: - ETA: 3:08 - batch_cost: 0.0661 - reader c - ETA: 3:07 - batch_cost: 0.0661 - reader cost: - ETA: 3:06 - batch_cost: 0.0661 - reader cost: 0.0 - ETA: 3:06 - batch_cost: 0.0661 - rea - ETA: 3:04 - batch_cost: 0.0661 - reader cost: 0.0 - ETA: 3:04 - batch_cost: 0.0661 - reader cost: 0 - ETA: 3:03 - batch_cost: 0.0661 - reader cost:  - ETA: 3:03 - batch_cost: 0.0660 - reader cost: 0.0 - ETA: 3:03 - batch_ - ETA: 3:00 - batch_cost: 0.0660 - reader cost - ETA: 2:59 - batch_cost: 0.0660 - re - ETA: 2:57 - batch_cost: 0.0660 - reader cost: 0.01 - ETA: 2:57 - batch_cost: 0.0660 - re - ETA: 2:56 - batch_cost: - ETA: 2:53 - batch_cost: 0. - ETA: 2:51 - batch_cost: 0.0660 - reader  - ETA: 2:50 - batch_cost: 0.0660 - reader cost: 0.0 - ETA: 2:50 - batch_cost: 0.0 - ETA: 2:48 - batch_cost: 0.0660 - reader cost: 0. - ETA: 2:48 - batch_cost: 0.0660 - reader cost: 0.0 - ETA: 2:48 - batch_cost: 0.0661 - reader cost: 0.0 - ETA: 2:47 - batch_cost: 0.0661 - reader cost: 0. - ETA: 2:47 - batch_cost: 0.0661 - reader cost: 0.01 - ETA: 2:47 - batch_cost - ETA: 2:45 - batch_cost: 0.0661 - reader cost:  - ETA: 2:44 - batch_cost: 0.0661 - r - ETA: 2:43 - batch_cost: 0.0661 - reader  - ETA: 2:42 - batch_cost: 0.0661 - reade - ETA: 2:41 - batch_cost: 0.0661 - reader - ETA: 2:40 - batch_cost:  - ETA: 2:38 - batch_cost: 0.0662 - reader cost: 0.01 - ETA: 2:38 - batch_cost: 0.0662 - reader cost: 0 - ETA: 2:37 - batch_cost: 0.0662 - reader cost: 0.01 - ETA: 2:37 - batch_cost: 0.0662 - - ETA: 2:36 - batch_cost: 0.0663 - reader cost: 0. - ETA: 2:36 - batch_cost: 0.0663  - ETA: 2:34 - batch_cost: 0 - ETA: 2:31 - batch_cost: 0. - ETA: - ETA: 2:22 - batch_cost: 0.0660 - reader cost: - ETA: 2:21 - batch_cost: 0.0659 - reader - ETA: 2 - ETA: 2:13 - batch_cost: 0.0657 - reader cost: 0.01 - ETA: 2:13 - batch_cost: 0.0657 - reade - ETA: 2:12 - batch_cost: 0.0658 - reader cost: 0.01 - ETA: 2:12 - batch_cost: 0.0658 - reader cost: 0.01 - ETA: 2:12 - batch_cost: 0.0658 - reader cost: 0 - ETA: 2:11 - batch_cost: 0.0658 - reader cost: 0.0 - ETA: 2:11 - batch_cost: 0.0658 - reader cost: 0.0 - ETA: 2:11 - batch_cost: 0.0658 - reader cos - ETA: 2:11 - batch_cost: 0.0659 - reader cost: 0. - ETA: 2:10 - batch_cost: 0.0659 - reader cost: 0.0 - ETA: 2:10 - batch_cost: 0.0659 - reader cost: 0.0 - ETA: 2:10 - batch_cost: 0.0659 - reader cost:  - ETA: 2:10 - batch_cost: 0.0660 - reader cost: - ETA: 2:09 - batch_cost: 0.0660 - reader cost: 0.010 - ETA: 2:09 - batch_cost: 0.0660 - reader cost - ETA: 2:09 - batch_cost: 0.0661 - reader cost: 0.01 - ETA: 2:09 - batch_cost: 0.0661 - reader cos - ETA: 2:08 - batch_cost: 0.0661 - reade - ETA: 2:06 - batch_cost: 0.0660 - reader cost:  - ETA: 2:05 - batch_cost: 0.0660 - reader cost: 0 - ETA: 2:05 - batch_cost: 0.0660 - reader cost: - ETA: 2:04 - batch_cost: 0.0660 - reader cost - ETA: 2:03 - batch_cost: - ETA: 1:59 - batch_cost: 0.0658 - - ETA: 1:57 - batch_cost: 0.0657 - reader cos - ETA: 1:55 - batch_cost: 0.0657 - reader cost:  - ET - ETA: 1:48 - batch_cost: 0.0655 - reader cost:  - ETA: 1:48 - batch_cost: 0.0654 - reader cost: 0.010 - ETA: 1:47 - batch_cost: 0.0654 - reader - ETA: 1:39 - batch_cost: 0.0652 - reader - ETA: 1:38 - batch_cost: 0.0651 - reader cost: 0.0 - ETA: 1:37 - batch_cost: 0.0651 - reader cost: 0.0 - ETA: 1:37 - batch_cost: 0.0651 - reader cost: 0.00 - ETA: 1:37 - batch_cost: 0.0652 - reader cost: 0 - ETA: 1:36 - batch_cost: 0.0652 - reader co - ETA: 1:36 - batch_cost: 0.06 - ETA: 1:33 - batch_cost: 0.0651 - read - ETA: 1:31 - batch_cost: 0.0651 - reader cost: 0.00 - ETA: 1:31 - batch_cost: 0.0 - ETA: 1:28 - batch_cost: 0.0650 - r - ETA: 1:26 - batch_cost: 0.0650 - reader cost: 0.0 - ETA: 1:26 - batch_cost: 0.0650 - reader  - ETA: 1:25 - batch_cost: 0.0650 - reader cost:  - ETA: 1:24 - batch_cost: 0.0650 - reader cos - ETA: 1:23 - batch_cost: 0.0650 - reader cost:  - ETA: 1:22 - batch_cost: 0 - ETA: 1:19 - batch_cost: 0.0648 - reader cost: 0.009 - ETA: 1:19 - batch_cost: 0.0648 - reader cos - ETA: 1:18 - batch_cost: 0.0648 - reader  - ETA: 1:16 - - ETA: 1:11 - batch_cost: 0.0646 - reader cost: 0.009 - ETA: 1:11 - batch_ - ETA: 1:07 - batch_cost: 0.0645 - reader cos - ETA: 1:06 - batch_cost: 0.0645 - read - ETA: 1:04 - batch_cost: 0.0644 - reader cost: 0.00 - ETA: 1:04 - batch_cost: 0.0644 - reader cost: 0.0 - ETA: 1:03 - batch_cost: 0.0644 - reader cost: 0.009 - ETA: 1:03 - batch_cost: 0.0644 - reader cost: 0 - ETA: 1:02 - batch_cost: 0.0644 - reader cost: 0.009 - ETA: 1:02 - batch_cost: 0.0644 - reade - ETA: 1:01 - batch_cost: 0.0643 - reader cost: 0.009 - ETA: 1:00 - batch_cost: 0.0 - ETA: 59s - batch_cost: 0.0643 - reader  - ETA: 58s - batch_cost: 0.0643 - reader co - ETA: 58s - batch_cost: 0.0644 - reade - ETA: 57s - batch_cost: 0.06 - ETA: 56s - batch_cost: 0.0643 - reader  - ETA: 55s - batch_cost: 0.0643 - r - ETA: 52s - batch_cost: 0.0644 - reader cost - ETA: 52s - batch_cost: 0.0644 - reader co - ETA: 52s - batch_cost: 0.0644 - reader cost: 0. - ETA: 51s - batch_cost: 0.0644 - reader co - ETA: 51s - batch_cost: 0.0645 - reader cost:  - ETA: 51s - batch_cost: 0.0645 - r - ETA: 50s - batch_cost: 0.0645 - reader cost: 0. - ETA: 50s - batch_cost: 0.0645 - reader co - ETA: 50s - batch_cost: 0.0646 - reader cost:  - ETA: 50s - batch_cost: 0.0646 - reader cost: 0.00 - ETA: 50s - batch_cost: 0.0646 - reader cost: 0. - ETA: 49s - batch_cost: 0.0646 - reader cost: 0. - ETA: 49s - batch_cost: 0.0646 - reader cost: 0.00 - ETA: 49s - batch_cost: 0.0646 - reader cost: 0. - ETA: 49s - batch_cost: 0.06 - ETA: 48s - batch_cost: 0.0646 - reader co - ETA: 47s - batch_cost: 0.0646 - reader cost: 0.00 - ETA: 47s - batch_cost: 0.0646 - r - ETA: 46s - batch_cost: 0.0645 - rea - ETA: 45s - batch_cost: 0.0645 - reader cost:  - ETA: 45s - batch_cost: 0.0645 - reader cost: 0.00 - ETA: 45s - batch_cost - ETA: 43s - batch_cost: 0.0645 - rea - ETA:  - ETA: 39s - batch_cost: 0.0644 - reader co - ETA: 38s - batch_cost: 0.0643 - rea - ETA: 37 - - ETA: 32s - batch_cost: 0.0642 - reader cost:  - ETA: 31s - batch_cost: 0.0642 - reader cost: 0. - ETA: 31s - batch_cost: 0.0642 - reader cost: 0. - ETA: 31s -  - ETA: 28s - batch_cost: 0.0641 - reader cost:  - ETA: 25s - batch_cost: 0.0640 - reader cost:  - ETA: 25s - batch_cost: 0.0640 - reader cost: 0. - ETA: 25s - batch_cost:  - ETA: 17s - batch_cost: 0.0639 - reader cost - ETA: 17s - batch_cost: 0.0639 - reade - ETA: 16s - batch_cost: 0.0640 - reader co - ETA: 16s - batch_cost: 0.0640 - reader cost: 0.00 - ETA: 16s - batch_cost: 0.0640 - reader cost - ETA: 15s - batch_cost: 0.0640 - reader cost:  - ETA: 15s - batch_cost: 0.0640 - reader cost - ETA: 15s - batch_cost: 0.0640 - reader cost - ETA: 14s - batch_cost - ETA: 13s - batch_cost:  - ETA: 11s - batch_cost: 0.0639 - reader cost: 0. - ETA: 11s - batch_cost: 0.06 - ETA: 10s - batch_cost: 0.0639 - reader cost: 0. - ETA: 9s - batch_cost: 0.0639 - reader  - ETA: 8s - batch_cost: 0.0638 - reader cost - ETA: 7s - batch_cost: 0.0638 - reader cost: 0.0 - ETA: 7s - batch_cost: 0.0638 - reader cost: 0 - ETA: 6s - batch_cost: 0.0638 -  - ETA: 4s - batch_cost: 0.0637 - reader cost: 0.00 - ETA: 3s - batch_cost: 0.0637 - reader cost: 0.007 - ETA: 3s - batch_cost: 0.0637 - reader cost: 0.007 - ETA: 3s - batch_cost: 0.0637 - reader cost:  - ETA: 2s - batch_cost: 0.0637 - reader cost: 0. - ETA: 2s - batch_cost: 0.0637 - - 471s 64ms/step - batch_cost: 0.0636 - reader cost: 0.0079
2022-07-21 15:44:39 [INFO]	[EVAL] #Images: 7361 mIoU: 0.1066 Acc: 0.9839 Kappa: 0.4030 Dice: 0.1351
2022-07-21 15:44:39 [INFO]	[EVAL] Class IoU: 
[0.9857 0.1112 0.0521 0.1735 0.0001 0.     0.     0.     0.2299 0.0064
 0.     0.     0.     0.     0.     0.     0.     0.     0.     0.5722]
2022-07-21 15:44:39 [INFO]	[EVAL] Class Precision: 
[0.9877 0.3447 0.3259 0.5863 0.1206 0.     0.     0.     0.555  0.1666
 0.     0.     0.     0.0962 0.     0.     0.     0.     0.     0.6706]
2022-07-21 15:44:39 [INFO]	[EVAL] Class Recall: 
[0.998  0.141  0.0584 0.1978 0.0001 0.     0.     0.     0.2819 0.0066
 0.     0.     0.     0.     0.     0.     0.     0.     0.     0.796 ]
2022-07-21 15:44:39 [INFO]	[EVAL] The model with the best validation mIoU (0.1066) was saved at iter 8000.
2022-07-21 15:44:40 [INFO]	[TRAIN] epoch: 2, iter: 8010/20000, loss: 0.2766, lr: 0.006310, batch_cost: 0.1433, reader_cost: 0.01768, ips: 27.9072 samples/sec | ETA 00:28:38
2022-07-21 15:44:43 [INFO]	[TRAIN] epoch: 2, iter: 8020/20000, loss: 0.3989, lr: 0.006305, batch_cost: 0.2489, reader_cost: 0.11926, ips: 16.0693 samples/sec | ETA 00:49:42
2022-07-21 15:44:45 [INFO]	[TRAIN] epoch: 2, iter: 8030/20000, loss: 0.4043, lr: 0.006301, batch_cost: 0.2428, reader_cost: 0.11646, ips: 16.4777 samples/sec | ETA 00:48:25
2022-07-21 15:44:48 [INFO]	[TRAIN] epoch: 2, iter: 8040/20000, loss: 0.3067, lr: 0.006296, batch_cost: 0.2321, reader_cost: 0.10884, ips: 17.2346 samples/sec | ETA 00:46:15
2022-07-21 15:44:51 [INFO]	[TRAIN] epoch: 2, iter: 8050/20000, loss: 0.4878, lr: 0.006291, batch_cost: 0.2927, reader_cost: 0.14845, ips: 13.6651 samples/sec | ETA 00:58:17
2022-07-21 15:44:53 [INFO]	[TRAIN] epoch: 2, iter: 8060/20000, loss: 0.3029, lr: 0.006287, batch_cost: 0.2351, reader_cost: 0.11031, ips: 17.0146 samples/sec | ETA 00:46:47
2022-07-21 15:44:55 [INFO]	[TRAIN] epoch: 2, iter: 8070/20000, loss: 0.3445, lr: 0.006282, batch_cost: 0.2349, reader_cost: 0.10511, ips: 17.0270 samples/sec | ETA 00:46:42
2022-07-21 15:44:58 [INFO]	[TRAIN] epoch: 2, iter: 8080/20000, loss: 0.3386, lr: 0.006277, batch_cost: 0.2336, reader_cost: 0.10242, ips: 17.1242 samples/sec | ETA 00:46:24
2022-07-21 15:45:00 [INFO]	[TRAIN] epoch: 2, iter: 8090/20000, loss: 0.3505, lr: 0.006272, batch_cost: 0.2749, reader_cost: 0.11246, ips: 14.5507 samples/sec | ETA 00:54:34
2022-07-21 15:45:04 [INFO]	[TRAIN] epoch: 2, iter: 8100/20000, loss: 0.3735, lr: 0.006268, batch_cost: 0.3426, reader_cost: 0.17639, ips: 11.6752 samples/sec | ETA 01:07:57
2022-07-21 15:45:06 [INFO]	[TRAIN] epoch: 2, iter: 8110/20000, loss: 0.3946, lr: 0.006263, batch_cost: 0.2326, reader_cost: 0.10904, ips: 17.1976 samples/sec | ETA 00:46:05
2022-07-21 15:45:08 [INFO]	[TRAIN] epoch: 2, iter: 8120/20000, loss: 0.3739, lr: 0.006258, batch_cost: 0.2217, reader_cost: 0.09773, ips: 18.0460 samples/sec | ETA 00:43:53
2022-07-21 15:45:11 [INFO]	[TRAIN] epoch: 2, iter: 8130/20000, loss: 0.5236, lr: 0.006253, batch_cost: 0.2237, reader_cost: 0.09852, ips: 17.8823 samples/sec | ETA 00:44:15
2022-07-21 15:45:13 [INFO]	[TRAIN] epoch: 2, iter: 8140/20000, loss: 0.4171, lr: 0.006249, batch_cost: 0.2346, reader_cost: 0.10903, ips: 17.0482 samples/sec | ETA 00:46:22
2022-07-21 15:45:15 [INFO]	[TRAIN] epoch: 2, iter: 8150/20000, loss: 0.4124, lr: 0.006244, batch_cost: 0.2369, reader_cost: 0.10989, ips: 16.8857 samples/sec | ETA 00:46:47
2022-07-21 15:45:18 [INFO]	[TRAIN] epoch: 2, iter: 8160/20000, loss: 0.4365, lr: 0.006239, batch_cost: 0.2284, reader_cost: 0.10616, ips: 17.5131 samples/sec | ETA 00:45:04
2022-07-21 15:45:20 [INFO]	[TRAIN] epoch: 2, iter: 8170/20000, loss: 0.4971, lr: 0.006234, batch_cost: 0.2287, reader_cost: 0.10474, ips: 17.4938 samples/sec | ETA 00:45:04
2022-07-21 15:45:23 [INFO]	[TRAIN] epoch: 2, iter: 8180/20000, loss: 0.4178, lr: 0.006230, batch_cost: 0.2756, reader_cost: 0.13271, ips: 14.5136 samples/sec | ETA 00:54:17
2022-07-21 15:45:25 [INFO]	[TRAIN] epoch: 2, iter: 8190/20000, loss: 0.3622, lr: 0.006225, batch_cost: 0.2292, reader_cost: 0.10659, ips: 17.4483 samples/sec | ETA 00:45:07
2022-07-21 15:45:28 [INFO]	[TRAIN] epoch: 2, iter: 8200/20000, loss: 0.3920, lr: 0.006220, batch_cost: 0.2753, reader_cost: 0.13128, ips: 14.5310 samples/sec | ETA 00:54:08
2022-07-21 15:45:30 [INFO]	[TRAIN] epoch: 2, iter: 8210/20000, loss: 0.4872, lr: 0.006215, batch_cost: 0.2456, reader_cost: 0.11249, ips: 16.2845 samples/sec | ETA 00:48:16
2022-07-21 15:45:32 [INFO]	[TRAIN] epoch: 2, iter: 8220/20000, loss: 0.2869, lr: 0.006211, batch_cost: 0.2313, reader_cost: 0.10790, ips: 17.2972 samples/sec | ETA 00:45:24
2022-07-21 15:45:35 [INFO]	[TRAIN] epoch: 2, iter: 8230/20000, loss: 0.4220, lr: 0.006206, batch_cost: 0.2341, reader_cost: 0.11079, ips: 17.0831 samples/sec | ETA 00:45:55
2022-07-21 15:45:37 [INFO]	[TRAIN] epoch: 2, iter: 8240/20000, loss: 0.2934, lr: 0.006201, batch_cost: 0.2256, reader_cost: 0.10137, ips: 17.7332 samples/sec | ETA 00:44:12
2022-07-21 15:45:40 [INFO]	[TRAIN] epoch: 2, iter: 8250/20000, loss: 0.3758, lr: 0.006196, batch_cost: 0.2592, reader_cost: 0.13628, ips: 15.4301 samples/sec | ETA 00:50:45
2022-07-21 15:45:42 [INFO]	[TRAIN] epoch: 2, iter: 8260/20000, loss: 0.3226, lr: 0.006192, batch_cost: 0.2549, reader_cost: 0.12604, ips: 15.6934 samples/sec | ETA 00:49:52
2022-07-21 15:45:44 [INFO]	[TRAIN] epoch: 2, iter: 8270/20000, loss: 0.4110, lr: 0.006187, batch_cost: 0.2241, reader_cost: 0.10053, ips: 17.8465 samples/sec | ETA 00:43:49
2022-07-21 15:45:47 [INFO]	[TRAIN] epoch: 2, iter: 8280/20000, loss: 0.3900, lr: 0.006182, batch_cost: 0.2316, reader_cost: 0.10508, ips: 17.2675 samples/sec | ETA 00:45:14
2022-07-21 15:45:49 [INFO]	[TRAIN] epoch: 2, iter: 8290/20000, loss: 0.4330, lr: 0.006177, batch_cost: 0.2290, reader_cost: 0.10649, ips: 17.4640 samples/sec | ETA 00:44:42
2022-07-21 15:45:51 [INFO]	[TRAIN] epoch: 2, iter: 8300/20000, loss: 0.4551, lr: 0.006173, batch_cost: 0.2257, reader_cost: 0.09937, ips: 17.7202 samples/sec | ETA 00:44:01
2022-07-21 15:45:54 [INFO]	[TRAIN] epoch: 2, iter: 8310/20000, loss: 0.3461, lr: 0.006168, batch_cost: 0.2793, reader_cost: 0.13615, ips: 14.3230 samples/sec | ETA 00:54:24
2022-07-21 15:45:57 [INFO]	[TRAIN] epoch: 2, iter: 8320/20000, loss: 0.2819, lr: 0.006163, batch_cost: 0.2380, reader_cost: 0.11318, ips: 16.8054 samples/sec | ETA 00:46:20
2022-07-21 15:45:59 [INFO]	[TRAIN] epoch: 2, iter: 8330/20000, loss: 0.4229, lr: 0.006158, batch_cost: 0.2389, reader_cost: 0.11195, ips: 16.7407 samples/sec | ETA 00:46:28
2022-07-21 15:46:01 [INFO]	[TRAIN] epoch: 2, iter: 8340/20000, loss: 0.2991, lr: 0.006154, batch_cost: 0.2471, reader_cost: 0.11625, ips: 16.1850 samples/sec | ETA 00:48:01
2022-07-21 15:46:04 [INFO]	[TRAIN] epoch: 2, iter: 8350/20000, loss: 0.4036, lr: 0.006149, batch_cost: 0.2535, reader_cost: 0.11695, ips: 15.7782 samples/sec | ETA 00:49:13
2022-07-21 15:46:07 [INFO]	[TRAIN] epoch: 2, iter: 8360/20000, loss: 0.3338, lr: 0.006144, batch_cost: 0.2793, reader_cost: 0.13255, ips: 14.3213 samples/sec | ETA 00:54:11
2022-07-21 15:46:10 [INFO]	[TRAIN] epoch: 2, iter: 8370/20000, loss: 0.4172, lr: 0.006139, batch_cost: 0.3689, reader_cost: 0.17745, ips: 10.8426 samples/sec | ETA 01:11:30
2022-07-21 15:46:13 [INFO]	[TRAIN] epoch: 2, iter: 8380/20000, loss: 0.3168, lr: 0.006135, batch_cost: 0.2472, reader_cost: 0.11993, ips: 16.1806 samples/sec | ETA 00:47:52
2022-07-21 15:46:15 [INFO]	[TRAIN] epoch: 2, iter: 8390/20000, loss: 0.4087, lr: 0.006130, batch_cost: 0.2320, reader_cost: 0.10576, ips: 17.2399 samples/sec | ETA 00:44:53
2022-07-21 15:46:17 [INFO]	[TRAIN] epoch: 2, iter: 8400/20000, loss: 0.3769, lr: 0.006125, batch_cost: 0.2148, reader_cost: 0.09196, ips: 18.6231 samples/sec | ETA 00:41:31
2022-07-21 15:46:20 [INFO]	[TRAIN] epoch: 2, iter: 8410/20000, loss: 0.3434, lr: 0.006120, batch_cost: 0.2500, reader_cost: 0.12346, ips: 15.9972 samples/sec | ETA 00:48:18
2022-07-21 15:46:22 [INFO]	[TRAIN] epoch: 2, iter: 8420/20000, loss: 0.3542, lr: 0.006116, batch_cost: 0.2431, reader_cost: 0.11838, ips: 16.4565 samples/sec | ETA 00:46:54
2022-07-21 15:46:25 [INFO]	[TRAIN] epoch: 2, iter: 8430/20000, loss: 0.3150, lr: 0.006111, batch_cost: 0.2751, reader_cost: 0.12519, ips: 14.5411 samples/sec | ETA 00:53:02
2022-07-21 15:46:27 [INFO]	[TRAIN] epoch: 2, iter: 8440/20000, loss: 0.2827, lr: 0.006106, batch_cost: 0.2379, reader_cost: 0.11540, ips: 16.8156 samples/sec | ETA 00:45:49
2022-07-21 15:46:30 [INFO]	[TRAIN] epoch: 2, iter: 8450/20000, loss: 0.3541, lr: 0.006101, batch_cost: 0.2704, reader_cost: 0.11784, ips: 14.7916 samples/sec | ETA 00:52:03
2022-07-21 15:46:32 [INFO]	[TRAIN] epoch: 2, iter: 8460/20000, loss: 0.3654, lr: 0.006097, batch_cost: 0.2334, reader_cost: 0.11022, ips: 17.1365 samples/sec | ETA 00:44:53
2022-07-21 15:46:35 [INFO]	[TRAIN] epoch: 2, iter: 8470/20000, loss: 0.4482, lr: 0.006092, batch_cost: 0.2440, reader_cost: 0.11948, ips: 16.3913 samples/sec | ETA 00:46:53
2022-07-21 15:46:37 [INFO]	[TRAIN] epoch: 2, iter: 8480/20000, loss: 0.3939, lr: 0.006087, batch_cost: 0.2332, reader_cost: 0.10974, ips: 17.1535 samples/sec | ETA 00:44:46
2022-07-21 15:46:40 [INFO]	[TRAIN] epoch: 2, iter: 8490/20000, loss: 0.3876, lr: 0.006082, batch_cost: 0.2654, reader_cost: 0.14426, ips: 15.0734 samples/sec | ETA 00:50:54
2022-07-21 15:46:42 [INFO]	[TRAIN] epoch: 2, iter: 8500/20000, loss: 0.3985, lr: 0.006078, batch_cost: 0.2517, reader_cost: 0.12342, ips: 15.8934 samples/sec | ETA 00:48:14
2022-07-21 15:46:45 [INFO]	[TRAIN] epoch: 2, iter: 8510/20000, loss: 0.3719, lr: 0.006073, batch_cost: 0.2289, reader_cost: 0.10306, ips: 17.4771 samples/sec | ETA 00:43:49
2022-07-21 15:46:47 [INFO]	[TRAIN] epoch: 2, iter: 8520/20000, loss: 0.2961, lr: 0.006068, batch_cost: 0.2316, reader_cost: 0.10777, ips: 17.2686 samples/sec | ETA 00:44:19
2022-07-21 15:46:50 [INFO]	[TRAIN] epoch: 2, iter: 8530/20000, loss: 0.2700, lr: 0.006063, batch_cost: 0.2627, reader_cost: 0.13880, ips: 15.2280 samples/sec | ETA 00:50:12
2022-07-21 15:46:52 [INFO]	[TRAIN] epoch: 2, iter: 8540/20000, loss: 0.4039, lr: 0.006059, batch_cost: 0.2444, reader_cost: 0.12187, ips: 16.3671 samples/sec | ETA 00:46:40
2022-07-21 15:46:54 [INFO]	[TRAIN] epoch: 2, iter: 8550/20000, loss: 0.3368, lr: 0.006054, batch_cost: 0.2404, reader_cost: 0.11667, ips: 16.6406 samples/sec | ETA 00:45:52
2022-07-21 15:46:57 [INFO]	[TRAIN] epoch: 2, iter: 8560/20000, loss: 0.3673, lr: 0.006049, batch_cost: 0.2821, reader_cost: 0.13864, ips: 14.1808 samples/sec | ETA 00:53:46
2022-07-21 15:47:00 [INFO]	[TRAIN] epoch: 2, iter: 8570/20000, loss: 0.4302, lr: 0.006044, batch_cost: 0.2413, reader_cost: 0.11875, ips: 16.5785 samples/sec | ETA 00:45:57
2022-07-21 15:47:02 [INFO]	[TRAIN] epoch: 2, iter: 8580/20000, loss: 0.2675, lr: 0.006040, batch_cost: 0.2393, reader_cost: 0.11386, ips: 16.7170 samples/sec | ETA 00:45:32
2022-07-21 15:47:04 [INFO]	[TRAIN] epoch: 2, iter: 8590/20000, loss: 0.4063, lr: 0.006035, batch_cost: 0.2385, reader_cost: 0.11242, ips: 16.7690 samples/sec | ETA 00:45:21
2022-07-21 15:47:07 [INFO]	[TRAIN] epoch: 2, iter: 8600/20000, loss: 0.3163, lr: 0.006030, batch_cost: 0.2519, reader_cost: 0.11682, ips: 15.8821 samples/sec | ETA 00:47:51
2022-07-21 15:47:09 [INFO]	[TRAIN] epoch: 2, iter: 8610/20000, loss: 0.3200, lr: 0.006025, batch_cost: 0.2234, reader_cost: 0.08122, ips: 17.9014 samples/sec | ETA 00:42:25
2022-07-21 15:47:12 [INFO]	[TRAIN] epoch: 2, iter: 8620/20000, loss: 0.3741, lr: 0.006021, batch_cost: 0.2400, reader_cost: 0.10857, ips: 16.6696 samples/sec | ETA 00:45:30
2022-07-21 15:47:15 [INFO]	[TRAIN] epoch: 2, iter: 8630/20000, loss: 0.3756, lr: 0.006016, batch_cost: 0.3348, reader_cost: 0.16522, ips: 11.9457 samples/sec | ETA 01:03:27
2022-07-21 15:47:18 [INFO]	[TRAIN] epoch: 2, iter: 8640/20000, loss: 0.5047, lr: 0.006011, batch_cost: 0.3262, reader_cost: 0.14204, ips: 12.2641 samples/sec | ETA 01:01:45
2022-07-21 15:47:21 [INFO]	[TRAIN] epoch: 2, iter: 8650/20000, loss: 0.2849, lr: 0.006006, batch_cost: 0.2326, reader_cost: 0.10460, ips: 17.1996 samples/sec | ETA 00:43:59
2022-07-21 15:47:23 [INFO]	[TRAIN] epoch: 2, iter: 8660/20000, loss: 0.5139, lr: 0.006001, batch_cost: 0.2351, reader_cost: 0.10916, ips: 17.0174 samples/sec | ETA 00:44:25
2022-07-21 15:47:26 [INFO]	[TRAIN] epoch: 2, iter: 8670/20000, loss: 0.3818, lr: 0.005997, batch_cost: 0.2662, reader_cost: 0.13807, ips: 15.0279 samples/sec | ETA 00:50:15
2022-07-21 15:47:28 [INFO]	[TRAIN] epoch: 2, iter: 8680/20000, loss: 0.3030, lr: 0.005992, batch_cost: 0.2644, reader_cost: 0.12243, ips: 15.1304 samples/sec | ETA 00:49:52
2022-07-21 15:47:31 [INFO]	[TRAIN] epoch: 2, iter: 8690/20000, loss: 0.2631, lr: 0.005987, batch_cost: 0.2462, reader_cost: 0.11931, ips: 16.2447 samples/sec | ETA 00:46:24
2022-07-21 15:47:34 [INFO]	[TRAIN] epoch: 2, iter: 8700/20000, loss: 0.3303, lr: 0.005982, batch_cost: 0.2877, reader_cost: 0.13708, ips: 13.9044 samples/sec | ETA 00:54:10
2022-07-21 15:47:36 [INFO]	[TRAIN] epoch: 2, iter: 8710/20000, loss: 0.4510, lr: 0.005978, batch_cost: 0.2357, reader_cost: 0.11370, ips: 16.9683 samples/sec | ETA 00:44:21
2022-07-21 15:47:38 [INFO]	[TRAIN] epoch: 2, iter: 8720/20000, loss: 0.2980, lr: 0.005973, batch_cost: 0.2384, reader_cost: 0.11671, ips: 16.7756 samples/sec | ETA 00:44:49
2022-07-21 15:47:41 [INFO]	[TRAIN] epoch: 2, iter: 8730/20000, loss: 0.4752, lr: 0.005968, batch_cost: 0.2336, reader_cost: 0.11292, ips: 17.1205 samples/sec | ETA 00:43:53
2022-07-21 15:47:43 [INFO]	[TRAIN] epoch: 2, iter: 8740/20000, loss: 0.2895, lr: 0.005963, batch_cost: 0.2120, reader_cost: 0.08844, ips: 18.8679 samples/sec | ETA 00:39:47
2022-07-21 15:47:45 [INFO]	[TRAIN] epoch: 2, iter: 8750/20000, loss: 0.5014, lr: 0.005959, batch_cost: 0.2244, reader_cost: 0.10152, ips: 17.8225 samples/sec | ETA 00:42:04
2022-07-21 15:47:48 [INFO]	[TRAIN] epoch: 2, iter: 8760/20000, loss: 0.4452, lr: 0.005954, batch_cost: 0.2496, reader_cost: 0.11774, ips: 16.0262 samples/sec | ETA 00:46:45
2022-07-21 15:47:50 [INFO]	[TRAIN] epoch: 2, iter: 8770/20000, loss: 0.4412, lr: 0.005949, batch_cost: 0.2501, reader_cost: 0.12233, ips: 15.9915 samples/sec | ETA 00:46:48
2022-07-21 15:47:52 [INFO]	[TRAIN] epoch: 2, iter: 8780/20000, loss: 0.4256, lr: 0.005944, batch_cost: 0.2300, reader_cost: 0.10605, ips: 17.3880 samples/sec | ETA 00:43:01
2022-07-21 15:47:55 [INFO]	[TRAIN] epoch: 2, iter: 8790/20000, loss: 0.3645, lr: 0.005940, batch_cost: 0.2510, reader_cost: 0.12926, ips: 15.9356 samples/sec | ETA 00:46:53
2022-07-21 15:47:57 [INFO]	[TRAIN] epoch: 2, iter: 8800/20000, loss: 0.3587, lr: 0.005935, batch_cost: 0.2469, reader_cost: 0.11739, ips: 16.2007 samples/sec | ETA 00:46:05
2022-07-21 15:48:00 [INFO]	[TRAIN] epoch: 2, iter: 8810/20000, loss: 0.4339, lr: 0.005930, batch_cost: 0.2665, reader_cost: 0.11200, ips: 15.0073 samples/sec | ETA 00:49:42
2022-07-21 15:48:02 [INFO]	[TRAIN] epoch: 2, iter: 8820/20000, loss: 0.3538, lr: 0.005925, batch_cost: 0.2394, reader_cost: 0.11325, ips: 16.7088 samples/sec | ETA 00:44:36
2022-07-21 15:48:05 [INFO]	[TRAIN] epoch: 2, iter: 8830/20000, loss: 0.3803, lr: 0.005920, batch_cost: 0.2275, reader_cost: 0.10072, ips: 17.5838 samples/sec | ETA 00:42:20
2022-07-21 15:48:07 [INFO]	[TRAIN] epoch: 2, iter: 8840/20000, loss: 0.4038, lr: 0.005916, batch_cost: 0.2326, reader_cost: 0.10755, ips: 17.1938 samples/sec | ETA 00:43:16
2022-07-21 15:48:09 [INFO]	[TRAIN] epoch: 2, iter: 8850/20000, loss: 0.4135, lr: 0.005911, batch_cost: 0.2374, reader_cost: 0.11548, ips: 16.8507 samples/sec | ETA 00:44:06
2022-07-21 15:48:12 [INFO]	[TRAIN] epoch: 2, iter: 8860/20000, loss: 0.3957, lr: 0.005906, batch_cost: 0.2330, reader_cost: 0.10611, ips: 17.1695 samples/sec | ETA 00:43:15
2022-07-21 15:48:14 [INFO]	[TRAIN] epoch: 2, iter: 8870/20000, loss: 0.4652, lr: 0.005901, batch_cost: 0.2313, reader_cost: 0.10478, ips: 17.2972 samples/sec | ETA 00:42:53
2022-07-21 15:48:16 [INFO]	[TRAIN] epoch: 2, iter: 8880/20000, loss: 0.3779, lr: 0.005897, batch_cost: 0.2346, reader_cost: 0.10679, ips: 17.0474 samples/sec | ETA 00:43:29
2022-07-21 15:48:19 [INFO]	[TRAIN] epoch: 2, iter: 8890/20000, loss: 0.3451, lr: 0.005892, batch_cost: 0.2330, reader_cost: 0.10531, ips: 17.1666 samples/sec | ETA 00:43:08
2022-07-21 15:48:21 [INFO]	[TRAIN] epoch: 2, iter: 8900/20000, loss: 0.3680, lr: 0.005887, batch_cost: 0.2422, reader_cost: 0.11150, ips: 16.5143 samples/sec | ETA 00:44:48
2022-07-21 15:48:25 [INFO]	[TRAIN] epoch: 2, iter: 8910/20000, loss: 0.2894, lr: 0.005882, batch_cost: 0.3805, reader_cost: 0.19572, ips: 10.5125 samples/sec | ETA 01:10:19
2022-07-21 15:48:27 [INFO]	[TRAIN] epoch: 2, iter: 8920/20000, loss: 0.3174, lr: 0.005878, batch_cost: 0.2441, reader_cost: 0.11382, ips: 16.3896 samples/sec | ETA 00:45:04
2022-07-21 15:48:30 [INFO]	[TRAIN] epoch: 2, iter: 8930/20000, loss: 0.3215, lr: 0.005873, batch_cost: 0.2370, reader_cost: 0.10949, ips: 16.8807 samples/sec | ETA 00:43:43
2022-07-21 15:48:32 [INFO]	[TRAIN] epoch: 2, iter: 8940/20000, loss: 0.3899, lr: 0.005868, batch_cost: 0.2687, reader_cost: 0.11786, ips: 14.8853 samples/sec | ETA 00:49:32
2022-07-21 15:48:35 [INFO]	[TRAIN] epoch: 2, iter: 8950/20000, loss: 0.3870, lr: 0.005863, batch_cost: 0.2755, reader_cost: 0.13071, ips: 14.5207 samples/sec | ETA 00:50:43
2022-07-21 15:48:38 [INFO]	[TRAIN] epoch: 2, iter: 8960/20000, loss: 0.4225, lr: 0.005858, batch_cost: 0.2351, reader_cost: 0.11167, ips: 17.0171 samples/sec | ETA 00:43:15
2022-07-21 15:48:40 [INFO]	[TRAIN] epoch: 2, iter: 8970/20000, loss: 0.3613, lr: 0.005854, batch_cost: 0.2311, reader_cost: 0.10717, ips: 17.3105 samples/sec | ETA 00:42:28
2022-07-21 15:48:42 [INFO]	[TRAIN] epoch: 2, iter: 8980/20000, loss: 0.2971, lr: 0.005849, batch_cost: 0.2455, reader_cost: 0.11656, ips: 16.2912 samples/sec | ETA 00:45:05
2022-07-21 15:48:45 [INFO]	[TRAIN] epoch: 2, iter: 8990/20000, loss: 0.5250, lr: 0.005844, batch_cost: 0.2521, reader_cost: 0.12502, ips: 15.8681 samples/sec | ETA 00:46:15
2022-07-21 15:48:47 [INFO]	[TRAIN] epoch: 2, iter: 9000/20000, loss: 0.4248, lr: 0.005839, batch_cost: 0.2316, reader_cost: 0.10610, ips: 17.2729 samples/sec | ETA 00:42:27
2022-07-21 15:48:50 [INFO]	[TRAIN] epoch: 2, iter: 9010/20000, loss: 0.3118, lr: 0.005835, batch_cost: 0.2553, reader_cost: 0.12845, ips: 15.6706 samples/sec | ETA 00:46:45
2022-07-21 15:48:52 [INFO]	[TRAIN] epoch: 2, iter: 9020/20000, loss: 0.4037, lr: 0.005830, batch_cost: 0.2249, reader_cost: 0.10420, ips: 17.7819 samples/sec | ETA 00:41:09
2022-07-21 15:48:54 [INFO]	[TRAIN] epoch: 2, iter: 9030/20000, loss: 0.3256, lr: 0.005825, batch_cost: 0.2361, reader_cost: 0.11303, ips: 16.9399 samples/sec | ETA 00:43:10
2022-07-21 15:48:57 [INFO]	[TRAIN] epoch: 2, iter: 9040/20000, loss: 0.3372, lr: 0.005820, batch_cost: 0.2265, reader_cost: 0.10272, ips: 17.6593 samples/sec | ETA 00:41:22
2022-07-21 15:48:59 [INFO]	[TRAIN] epoch: 2, iter: 9050/20000, loss: 0.3766, lr: 0.005815, batch_cost: 0.2168, reader_cost: 0.09411, ips: 18.4498 samples/sec | ETA 00:39:34
2022-07-21 15:49:01 [INFO]	[TRAIN] epoch: 2, iter: 9060/20000, loss: 0.3565, lr: 0.005811, batch_cost: 0.2506, reader_cost: 0.12767, ips: 15.9612 samples/sec | ETA 00:45:41
2022-07-21 15:49:04 [INFO]	[TRAIN] epoch: 2, iter: 9070/20000, loss: 0.3560, lr: 0.005806, batch_cost: 0.2670, reader_cost: 0.14238, ips: 14.9817 samples/sec | ETA 00:48:38
2022-07-21 15:49:06 [INFO]	[TRAIN] epoch: 2, iter: 9080/20000, loss: 0.2961, lr: 0.005801, batch_cost: 0.2451, reader_cost: 0.12062, ips: 16.3178 samples/sec | ETA 00:44:36
2022-07-21 15:49:09 [INFO]	[TRAIN] epoch: 2, iter: 9090/20000, loss: 0.2933, lr: 0.005796, batch_cost: 0.2376, reader_cost: 0.11447, ips: 16.8330 samples/sec | ETA 00:43:12
2022-07-21 15:49:11 [INFO]	[TRAIN] epoch: 2, iter: 9100/20000, loss: 0.3168, lr: 0.005792, batch_cost: 0.2478, reader_cost: 0.12324, ips: 16.1412 samples/sec | ETA 00:45:01
2022-07-21 15:49:14 [INFO]	[TRAIN] epoch: 2, iter: 9110/20000, loss: 0.4104, lr: 0.005787, batch_cost: 0.2392, reader_cost: 0.11436, ips: 16.7220 samples/sec | ETA 00:43:24
2022-07-21 15:49:16 [INFO]	[TRAIN] epoch: 2, iter: 9120/20000, loss: 0.3228, lr: 0.005782, batch_cost: 0.2454, reader_cost: 0.12269, ips: 16.3001 samples/sec | ETA 00:44:29
2022-07-21 15:49:18 [INFO]	[TRAIN] epoch: 2, iter: 9130/20000, loss: 0.4526, lr: 0.005777, batch_cost: 0.2445, reader_cost: 0.11609, ips: 16.3603 samples/sec | ETA 00:44:17
2022-07-21 15:49:21 [INFO]	[TRAIN] epoch: 2, iter: 9140/20000, loss: 0.3368, lr: 0.005772, batch_cost: 0.2613, reader_cost: 0.13485, ips: 15.3073 samples/sec | ETA 00:47:17
2022-07-21 15:49:24 [INFO]	[TRAIN] epoch: 2, iter: 9150/20000, loss: 0.4126, lr: 0.005768, batch_cost: 0.2453, reader_cost: 0.11612, ips: 16.3097 samples/sec | ETA 00:44:20
2022-07-21 15:49:26 [INFO]	[TRAIN] epoch: 2, iter: 9160/20000, loss: 0.3478, lr: 0.005763, batch_cost: 0.2270, reader_cost: 0.10271, ips: 17.6245 samples/sec | ETA 00:41:00
2022-07-21 15:49:29 [INFO]	[TRAIN] epoch: 2, iter: 9170/20000, loss: 0.3692, lr: 0.005758, batch_cost: 0.2762, reader_cost: 0.13346, ips: 14.4845 samples/sec | ETA 00:49:50
2022-07-21 15:49:32 [INFO]	[TRAIN] epoch: 2, iter: 9180/20000, loss: 0.3648, lr: 0.005753, batch_cost: 0.3676, reader_cost: 0.17534, ips: 10.8810 samples/sec | ETA 01:06:17
2022-07-21 15:49:35 [INFO]	[TRAIN] epoch: 2, iter: 9190/20000, loss: 0.3475, lr: 0.005748, batch_cost: 0.2755, reader_cost: 0.12560, ips: 14.5174 samples/sec | ETA 00:49:38
2022-07-21 15:49:38 [INFO]	[TRAIN] epoch: 2, iter: 9200/20000, loss: 0.4395, lr: 0.005744, batch_cost: 0.2706, reader_cost: 0.13512, ips: 14.7833 samples/sec | ETA 00:48:42
2022-07-21 15:49:40 [INFO]	[TRAIN] epoch: 2, iter: 9210/20000, loss: 0.3647, lr: 0.005739, batch_cost: 0.2508, reader_cost: 0.12625, ips: 15.9458 samples/sec | ETA 00:45:06
2022-07-21 15:49:43 [INFO]	[TRAIN] epoch: 2, iter: 9220/20000, loss: 0.3713, lr: 0.005734, batch_cost: 0.2344, reader_cost: 0.10845, ips: 17.0652 samples/sec | ETA 00:42:06
2022-07-21 15:49:45 [INFO]	[TRAIN] epoch: 2, iter: 9230/20000, loss: 0.3257, lr: 0.005729, batch_cost: 0.2239, reader_cost: 0.10052, ips: 17.8662 samples/sec | ETA 00:40:11
2022-07-21 15:49:47 [INFO]	[TRAIN] epoch: 2, iter: 9240/20000, loss: 0.3977, lr: 0.005725, batch_cost: 0.2162, reader_cost: 0.09413, ips: 18.5008 samples/sec | ETA 00:38:46
2022-07-21 15:49:49 [INFO]	[TRAIN] epoch: 2, iter: 9250/20000, loss: 0.3130, lr: 0.005720, batch_cost: 0.2389, reader_cost: 0.11616, ips: 16.7427 samples/sec | ETA 00:42:48
2022-07-21 15:49:52 [INFO]	[TRAIN] epoch: 2, iter: 9260/20000, loss: 0.3605, lr: 0.005715, batch_cost: 0.2394, reader_cost: 0.11673, ips: 16.7109 samples/sec | ETA 00:42:50
2022-07-21 15:49:54 [INFO]	[TRAIN] epoch: 2, iter: 9270/20000, loss: 0.3616, lr: 0.005710, batch_cost: 0.2490, reader_cost: 0.12577, ips: 16.0629 samples/sec | ETA 00:44:31
2022-07-21 15:49:57 [INFO]	[TRAIN] epoch: 2, iter: 9280/20000, loss: 0.5049, lr: 0.005705, batch_cost: 0.2452, reader_cost: 0.11809, ips: 16.3111 samples/sec | ETA 00:43:48
2022-07-21 15:49:59 [INFO]	[TRAIN] epoch: 2, iter: 9290/20000, loss: 0.3136, lr: 0.005701, batch_cost: 0.2408, reader_cost: 0.11760, ips: 16.6134 samples/sec | ETA 00:42:58
2022-07-21 15:50:01 [INFO]	[TRAIN] epoch: 2, iter: 9300/20000, loss: 0.3065, lr: 0.005696, batch_cost: 0.2323, reader_cost: 0.10813, ips: 17.2167 samples/sec | ETA 00:41:25
2022-07-21 15:50:04 [INFO]	[TRAIN] epoch: 2, iter: 9310/20000, loss: 0.3286, lr: 0.005691, batch_cost: 0.2347, reader_cost: 0.11183, ips: 17.0432 samples/sec | ETA 00:41:48
2022-07-21 15:50:06 [INFO]	[TRAIN] epoch: 2, iter: 9320/20000, loss: 0.2784, lr: 0.005686, batch_cost: 0.2697, reader_cost: 0.12694, ips: 14.8334 samples/sec | ETA 00:47:59
2022-07-21 15:50:09 [INFO]	[TRAIN] epoch: 2, iter: 9330/20000, loss: 0.3032, lr: 0.005681, batch_cost: 0.2374, reader_cost: 0.11419, ips: 16.8499 samples/sec | ETA 00:42:12
2022-07-21 15:50:11 [INFO]	[TRAIN] epoch: 2, iter: 9340/20000, loss: 0.3179, lr: 0.005677, batch_cost: 0.2362, reader_cost: 0.11396, ips: 16.9381 samples/sec | ETA 00:41:57
2022-07-21 15:50:14 [INFO]	[TRAIN] epoch: 2, iter: 9350/20000, loss: 0.3131, lr: 0.005672, batch_cost: 0.2359, reader_cost: 0.11221, ips: 16.9556 samples/sec | ETA 00:41:52
2022-07-21 15:50:16 [INFO]	[TRAIN] epoch: 2, iter: 9360/20000, loss: 0.3538, lr: 0.005667, batch_cost: 0.2456, reader_cost: 0.12091, ips: 16.2841 samples/sec | ETA 00:43:33
2022-07-21 15:50:18 [INFO]	[TRAIN] epoch: 2, iter: 9370/20000, loss: 0.4056, lr: 0.005662, batch_cost: 0.2277, reader_cost: 0.10336, ips: 17.5696 samples/sec | ETA 00:40:20
2022-07-21 15:50:21 [INFO]	[TRAIN] epoch: 2, iter: 9380/20000, loss: 0.4474, lr: 0.005657, batch_cost: 0.2408, reader_cost: 0.11824, ips: 16.6138 samples/sec | ETA 00:42:36
2022-07-21 15:50:23 [INFO]	[TRAIN] epoch: 2, iter: 9390/20000, loss: 0.3254, lr: 0.005653, batch_cost: 0.2435, reader_cost: 0.11575, ips: 16.4271 samples/sec | ETA 00:43:03
2022-07-21 15:50:26 [INFO]	[TRAIN] epoch: 2, iter: 9400/20000, loss: 0.4234, lr: 0.005648, batch_cost: 0.2457, reader_cost: 0.11927, ips: 16.2787 samples/sec | ETA 00:43:24
2022-07-21 15:50:28 [INFO]	[TRAIN] epoch: 2, iter: 9410/20000, loss: 0.4784, lr: 0.005643, batch_cost: 0.2479, reader_cost: 0.10087, ips: 16.1355 samples/sec | ETA 00:43:45
2022-07-21 15:50:30 [INFO]	[TRAIN] epoch: 2, iter: 9420/20000, loss: 0.3366, lr: 0.005638, batch_cost: 0.2370, reader_cost: 0.09561, ips: 16.8767 samples/sec | ETA 00:41:47
2022-07-21 15:50:33 [INFO]	[TRAIN] epoch: 2, iter: 9430/20000, loss: 0.3297, lr: 0.005633, batch_cost: 0.2407, reader_cost: 0.11481, ips: 16.6201 samples/sec | ETA 00:42:23
2022-07-21 15:50:35 [INFO]	[TRAIN] epoch: 2, iter: 9440/20000, loss: 0.3596, lr: 0.005629, batch_cost: 0.2400, reader_cost: 0.10771, ips: 16.6645 samples/sec | ETA 00:42:14
2022-07-21 15:50:40 [INFO]	[TRAIN] epoch: 2, iter: 9450/20000, loss: 0.3022, lr: 0.005624, batch_cost: 0.4813, reader_cost: 0.24061, ips: 8.3100 samples/sec | ETA 01:24:38
2022-07-21 15:50:43 [INFO]	[TRAIN] epoch: 2, iter: 9460/20000, loss: 0.3640, lr: 0.005619, batch_cost: 0.2802, reader_cost: 0.13757, ips: 14.2760 samples/sec | ETA 00:49:13
2022-07-21 15:50:45 [INFO]	[TRAIN] epoch: 2, iter: 9470/20000, loss: 0.5272, lr: 0.005614, batch_cost: 0.2343, reader_cost: 0.11118, ips: 17.0746 samples/sec | ETA 00:41:06
2022-07-21 15:50:48 [INFO]	[TRAIN] epoch: 2, iter: 9480/20000, loss: 0.3791, lr: 0.005610, batch_cost: 0.2344, reader_cost: 0.11076, ips: 17.0641 samples/sec | ETA 00:41:05
2022-07-21 15:50:50 [INFO]	[TRAIN] epoch: 2, iter: 9490/20000, loss: 0.3625, lr: 0.005605, batch_cost: 0.2301, reader_cost: 0.10060, ips: 17.3806 samples/sec | ETA 00:40:18
2022-07-21 15:50:52 [INFO]	[TRAIN] epoch: 2, iter: 9500/20000, loss: 0.3030, lr: 0.005600, batch_cost: 0.2301, reader_cost: 0.10167, ips: 17.3864 samples/sec | ETA 00:40:15
2022-07-21 15:50:55 [INFO]	[TRAIN] epoch: 2, iter: 9510/20000, loss: 0.4320, lr: 0.005595, batch_cost: 0.2331, reader_cost: 0.10934, ips: 17.1587 samples/sec | ETA 00:40:45
2022-07-21 15:50:57 [INFO]	[TRAIN] epoch: 2, iter: 9520/20000, loss: 0.4028, lr: 0.005590, batch_cost: 0.2372, reader_cost: 0.11497, ips: 16.8659 samples/sec | ETA 00:41:25
2022-07-21 15:50:59 [INFO]	[TRAIN] epoch: 2, iter: 9530/20000, loss: 0.4041, lr: 0.005586, batch_cost: 0.2418, reader_cost: 0.11759, ips: 16.5450 samples/sec | ETA 00:42:11
2022-07-21 15:51:02 [INFO]	[TRAIN] epoch: 2, iter: 9540/20000, loss: 0.3686, lr: 0.005581, batch_cost: 0.2354, reader_cost: 0.10995, ips: 16.9915 samples/sec | ETA 00:41:02
2022-07-21 15:51:04 [INFO]	[TRAIN] epoch: 2, iter: 9550/20000, loss: 0.3841, lr: 0.005576, batch_cost: 0.2310, reader_cost: 0.10705, ips: 17.3196 samples/sec | ETA 00:40:13
2022-07-21 15:51:06 [INFO]	[TRAIN] epoch: 2, iter: 9560/20000, loss: 0.3477, lr: 0.005571, batch_cost: 0.2485, reader_cost: 0.12605, ips: 16.0950 samples/sec | ETA 00:43:14
2022-07-21 15:51:09 [INFO]	[TRAIN] epoch: 2, iter: 9570/20000, loss: 0.3367, lr: 0.005566, batch_cost: 0.2214, reader_cost: 0.09981, ips: 18.0638 samples/sec | ETA 00:38:29
2022-07-21 15:51:11 [INFO]	[TRAIN] epoch: 2, iter: 9580/20000, loss: 0.2829, lr: 0.005561, batch_cost: 0.2722, reader_cost: 0.12292, ips: 14.6926 samples/sec | ETA 00:47:16
2022-07-21 15:51:14 [INFO]	[TRAIN] epoch: 2, iter: 9590/20000, loss: 0.3803, lr: 0.005557, batch_cost: 0.2274, reader_cost: 0.10367, ips: 17.5907 samples/sec | ETA 00:39:27
2022-07-21 15:51:16 [INFO]	[TRAIN] epoch: 2, iter: 9600/20000, loss: 0.3151, lr: 0.005552, batch_cost: 0.2375, reader_cost: 0.11134, ips: 16.8441 samples/sec | ETA 00:41:09
2022-07-21 15:51:18 [INFO]	[TRAIN] epoch: 2, iter: 9610/20000, loss: 0.3803, lr: 0.005547, batch_cost: 0.2342, reader_cost: 0.11246, ips: 17.0824 samples/sec | ETA 00:40:32
2022-07-21 15:51:21 [INFO]	[TRAIN] epoch: 2, iter: 9620/20000, loss: 0.4055, lr: 0.005542, batch_cost: 0.2347, reader_cost: 0.11084, ips: 17.0450 samples/sec | ETA 00:40:35
2022-07-21 15:51:23 [INFO]	[TRAIN] epoch: 2, iter: 9630/20000, loss: 0.3595, lr: 0.005537, batch_cost: 0.2347, reader_cost: 0.10829, ips: 17.0422 samples/sec | ETA 00:40:33
2022-07-21 15:51:26 [INFO]	[TRAIN] epoch: 2, iter: 9640/20000, loss: 0.3313, lr: 0.005533, batch_cost: 0.2593, reader_cost: 0.13146, ips: 15.4273 samples/sec | ETA 00:44:46
2022-07-21 15:51:28 [INFO]	[TRAIN] epoch: 2, iter: 9650/20000, loss: 0.3234, lr: 0.005528, batch_cost: 0.2241, reader_cost: 0.10134, ips: 17.8528 samples/sec | ETA 00:38:38
2022-07-21 15:51:30 [INFO]	[TRAIN] epoch: 2, iter: 9660/20000, loss: 0.2805, lr: 0.005523, batch_cost: 0.2413, reader_cost: 0.11405, ips: 16.5764 samples/sec | ETA 00:41:35
2022-07-21 15:51:33 [INFO]	[TRAIN] epoch: 2, iter: 9670/20000, loss: 0.3706, lr: 0.005518, batch_cost: 0.2376, reader_cost: 0.10820, ips: 16.8353 samples/sec | ETA 00:40:54
2022-07-21 15:51:35 [INFO]	[TRAIN] epoch: 2, iter: 9680/20000, loss: 0.3469, lr: 0.005513, batch_cost: 0.2386, reader_cost: 0.11377, ips: 16.7613 samples/sec | ETA 00:41:02
2022-07-21 15:51:37 [INFO]	[TRAIN] epoch: 2, iter: 9690/20000, loss: 0.3864, lr: 0.005509, batch_cost: 0.2255, reader_cost: 0.09816, ips: 17.7407 samples/sec | ETA 00:38:44
2022-07-21 15:51:40 [INFO]	[TRAIN] epoch: 2, iter: 9700/20000, loss: 0.2648, lr: 0.005504, batch_cost: 0.2294, reader_cost: 0.10375, ips: 17.4368 samples/sec | ETA 00:39:22
2022-07-21 15:51:43 [INFO]	[TRAIN] epoch: 2, iter: 9710/20000, loss: 0.3084, lr: 0.005499, batch_cost: 0.2924, reader_cost: 0.14242, ips: 13.6820 samples/sec | ETA 00:50:08
2022-07-21 15:51:47 [INFO]	[TRAIN] epoch: 2, iter: 9720/20000, loss: 0.2482, lr: 0.005494, batch_cost: 0.4534, reader_cost: 0.24070, ips: 8.8216 samples/sec | ETA 01:17:41
2022-07-21 15:51:50 [INFO]	[TRAIN] epoch: 2, iter: 9730/20000, loss: 0.3241, lr: 0.005489, batch_cost: 0.2865, reader_cost: 0.12204, ips: 13.9601 samples/sec | ETA 00:49:02
2022-07-21 15:51:52 [INFO]	[TRAIN] epoch: 2, iter: 9740/20000, loss: 0.3215, lr: 0.005485, batch_cost: 0.2376, reader_cost: 0.10781, ips: 16.8354 samples/sec | ETA 00:40:37
2022-07-21 15:51:55 [INFO]	[TRAIN] epoch: 2, iter: 9750/20000, loss: 0.3278, lr: 0.005480, batch_cost: 0.2538, reader_cost: 0.13096, ips: 15.7593 samples/sec | ETA 00:43:21
2022-07-21 15:51:57 [INFO]	[TRAIN] epoch: 2, iter: 9760/20000, loss: 0.3433, lr: 0.005475, batch_cost: 0.2389, reader_cost: 0.11304, ips: 16.7427 samples/sec | ETA 00:40:46
2022-07-21 15:52:00 [INFO]	[TRAIN] epoch: 2, iter: 9770/20000, loss: 0.3239, lr: 0.005470, batch_cost: 0.2266, reader_cost: 0.10443, ips: 17.6500 samples/sec | ETA 00:38:38
2022-07-21 15:52:02 [INFO]	[TRAIN] epoch: 2, iter: 9780/20000, loss: 0.3441, lr: 0.005465, batch_cost: 0.2384, reader_cost: 0.11380, ips: 16.7755 samples/sec | ETA 00:40:36
2022-07-21 15:52:04 [INFO]	[TRAIN] epoch: 2, iter: 9790/20000, loss: 0.3238, lr: 0.005461, batch_cost: 0.2428, reader_cost: 0.12142, ips: 16.4738 samples/sec | ETA 00:41:19
2022-07-21 15:52:07 [INFO]	[TRAIN] epoch: 2, iter: 9800/20000, loss: 0.4424, lr: 0.005456, batch_cost: 0.2271, reader_cost: 0.10291, ips: 17.6137 samples/sec | ETA 00:38:36
2022-07-21 15:52:09 [INFO]	[TRAIN] epoch: 2, iter: 9810/20000, loss: 0.4047, lr: 0.005451, batch_cost: 0.2239, reader_cost: 0.10091, ips: 17.8667 samples/sec | ETA 00:38:01
2022-07-21 15:52:11 [INFO]	[TRAIN] epoch: 2, iter: 9820/20000, loss: 0.2742, lr: 0.005446, batch_cost: 0.2292, reader_cost: 0.10405, ips: 17.4501 samples/sec | ETA 00:38:53
2022-07-21 15:52:14 [INFO]	[TRAIN] epoch: 2, iter: 9830/20000, loss: 0.4473, lr: 0.005441, batch_cost: 0.2813, reader_cost: 0.13170, ips: 14.2214 samples/sec | ETA 00:47:40
2022-07-21 15:52:16 [INFO]	[TRAIN] epoch: 2, iter: 9840/20000, loss: 0.2918, lr: 0.005436, batch_cost: 0.2409, reader_cost: 0.11454, ips: 16.6021 samples/sec | ETA 00:40:47
2022-07-21 15:52:19 [INFO]	[TRAIN] epoch: 2, iter: 9850/20000, loss: 0.3203, lr: 0.005432, batch_cost: 0.2265, reader_cost: 0.10607, ips: 17.6608 samples/sec | ETA 00:38:18
2022-07-21 15:52:21 [INFO]	[TRAIN] epoch: 2, iter: 9860/20000, loss: 0.2830, lr: 0.005427, batch_cost: 0.2316, reader_cost: 0.10870, ips: 17.2721 samples/sec | ETA 00:39:08
2022-07-21 15:52:23 [INFO]	[TRAIN] epoch: 2, iter: 9870/20000, loss: 0.4303, lr: 0.005422, batch_cost: 0.2382, reader_cost: 0.11487, ips: 16.7905 samples/sec | ETA 00:40:13
2022-07-21 15:52:26 [INFO]	[TRAIN] epoch: 2, iter: 9880/20000, loss: 0.3192, lr: 0.005417, batch_cost: 0.2230, reader_cost: 0.09983, ips: 17.9351 samples/sec | ETA 00:37:37
2022-07-21 15:52:28 [INFO]	[TRAIN] epoch: 2, iter: 9890/20000, loss: 0.3606, lr: 0.005412, batch_cost: 0.2333, reader_cost: 0.11001, ips: 17.1450 samples/sec | ETA 00:39:18
2022-07-21 15:52:30 [INFO]	[TRAIN] epoch: 2, iter: 9900/20000, loss: 0.4961, lr: 0.005408, batch_cost: 0.2359, reader_cost: 0.11176, ips: 16.9560 samples/sec | ETA 00:39:42
2022-07-21 15:52:33 [INFO]	[TRAIN] epoch: 3, iter: 9910/20000, loss: 0.3578, lr: 0.005403, batch_cost: 0.2364, reader_cost: 0.11463, ips: 16.9240 samples/sec | ETA 00:39:44
2022-07-21 15:52:35 [INFO]	[TRAIN] epoch: 3, iter: 9920/20000, loss: 0.3289, lr: 0.005398, batch_cost: 0.2442, reader_cost: 0.11116, ips: 16.3774 samples/sec | ETA 00:41:01
2022-07-21 15:52:38 [INFO]	[TRAIN] epoch: 3, iter: 9930/20000, loss: 0.3882, lr: 0.005393, batch_cost: 0.2397, reader_cost: 0.11101, ips: 16.6864 samples/sec | ETA 00:40:13
2022-07-21 15:52:40 [INFO]	[TRAIN] epoch: 3, iter: 9940/20000, loss: 0.4135, lr: 0.005388, batch_cost: 0.2484, reader_cost: 0.12390, ips: 16.1000 samples/sec | ETA 00:41:39
2022-07-21 15:52:42 [INFO]	[TRAIN] epoch: 3, iter: 9950/20000, loss: 0.2918, lr: 0.005383, batch_cost: 0.2357, reader_cost: 0.10805, ips: 16.9740 samples/sec | ETA 00:39:28
2022-07-21 15:52:45 [INFO]	[TRAIN] epoch: 3, iter: 9960/20000, loss: 0.3873, lr: 0.005379, batch_cost: 0.2779, reader_cost: 0.14092, ips: 14.3958 samples/sec | ETA 00:46:29
2022-07-21 15:52:48 [INFO]	[TRAIN] epoch: 3, iter: 9970/20000, loss: 0.2681, lr: 0.005374, batch_cost: 0.3078, reader_cost: 0.15138, ips: 12.9947 samples/sec | ETA 00:51:27
2022-07-21 15:52:51 [INFO]	[TRAIN] epoch: 3, iter: 9980/20000, loss: 0.3009, lr: 0.005369, batch_cost: 0.2595, reader_cost: 0.12844, ips: 15.4138 samples/sec | ETA 00:43:20
2022-07-21 15:52:54 [INFO]	[TRAIN] epoch: 3, iter: 9990/20000, loss: 0.3348, lr: 0.005364, batch_cost: 0.3607, reader_cost: 0.17451, ips: 11.0892 samples/sec | ETA 01:00:10
2022-07-21 15:52:57 [INFO]	[TRAIN] epoch: 3, iter: 10000/20000, loss: 0.3239, lr: 0.005359, batch_cost: 0.2901, reader_cost: 0.15077, ips: 13.7867 samples/sec | ETA 00:48:21
2022-07-21 15:53:00 [INFO]	[TRAIN] epoch: 3, iter: 10010/20000, loss: 0.2764, lr: 0.005355, batch_cost: 0.2375, reader_cost: 0.11299, ips: 16.8409 samples/sec | ETA 00:39:32
2022-07-21 15:53:02 [INFO]	[TRAIN] epoch: 3, iter: 10020/20000, loss: 0.3123, lr: 0.005350, batch_cost: 0.2486, reader_cost: 0.12681, ips: 16.0912 samples/sec | ETA 00:41:20
2022-07-21 15:53:05 [INFO]	[TRAIN] epoch: 3, iter: 10030/20000, loss: 0.4088, lr: 0.005345, batch_cost: 0.2389, reader_cost: 0.11469, ips: 16.7464 samples/sec | ETA 00:39:41
2022-07-21 15:53:07 [INFO]	[TRAIN] epoch: 3, iter: 10040/20000, loss: 0.4949, lr: 0.005340, batch_cost: 0.2340, reader_cost: 0.11082, ips: 17.0952 samples/sec | ETA 00:38:50
2022-07-21 15:53:09 [INFO]	[TRAIN] epoch: 3, iter: 10050/20000, loss: 0.3278, lr: 0.005335, batch_cost: 0.2228, reader_cost: 0.10070, ips: 17.9558 samples/sec | ETA 00:36:56
2022-07-21 15:53:12 [INFO]	[TRAIN] epoch: 3, iter: 10060/20000, loss: 0.4127, lr: 0.005330, batch_cost: 0.2446, reader_cost: 0.11689, ips: 16.3563 samples/sec | ETA 00:40:30
2022-07-21 15:53:14 [INFO]	[TRAIN] epoch: 3, iter: 10070/20000, loss: 0.4148, lr: 0.005326, batch_cost: 0.2330, reader_cost: 0.10767, ips: 17.1692 samples/sec | ETA 00:38:33
2022-07-21 15:53:16 [INFO]	[TRAIN] epoch: 3, iter: 10080/20000, loss: 0.3814, lr: 0.005321, batch_cost: 0.2506, reader_cost: 0.11859, ips: 15.9607 samples/sec | ETA 00:41:26
2022-07-21 15:53:19 [INFO]	[TRAIN] epoch: 3, iter: 10090/20000, loss: 0.4552, lr: 0.005316, batch_cost: 0.2461, reader_cost: 0.11514, ips: 16.2538 samples/sec | ETA 00:40:38
2022-07-21 15:53:21 [INFO]	[TRAIN] epoch: 3, iter: 10100/20000, loss: 0.3444, lr: 0.005311, batch_cost: 0.2309, reader_cost: 0.10600, ips: 17.3266 samples/sec | ETA 00:38:05
2022-07-21 15:53:24 [INFO]	[TRAIN] epoch: 3, iter: 10110/20000, loss: 0.3150, lr: 0.005306, batch_cost: 0.2322, reader_cost: 0.10636, ips: 17.2276 samples/sec | ETA 00:38:16
2022-07-21 15:53:26 [INFO]	[TRAIN] epoch: 3, iter: 10120/20000, loss: 0.3417, lr: 0.005301, batch_cost: 0.2222, reader_cost: 0.09689, ips: 18.0056 samples/sec | ETA 00:36:34
2022-07-21 15:53:28 [INFO]	[TRAIN] epoch: 3, iter: 10130/20000, loss: 0.3496, lr: 0.005297, batch_cost: 0.2202, reader_cost: 0.09236, ips: 18.1683 samples/sec | ETA 00:36:13
2022-07-21 15:53:30 [INFO]	[TRAIN] epoch: 3, iter: 10140/20000, loss: 0.3413, lr: 0.005292, batch_cost: 0.2240, reader_cost: 0.09447, ips: 17.8580 samples/sec | ETA 00:36:48
2022-07-21 15:53:33 [INFO]	[TRAIN] epoch: 3, iter: 10150/20000, loss: 0.2704, lr: 0.005287, batch_cost: 0.2354, reader_cost: 0.10732, ips: 16.9914 samples/sec | ETA 00:38:38
2022-07-21 15:53:35 [INFO]	[TRAIN] epoch: 3, iter: 10160/20000, loss: 0.3593, lr: 0.005282, batch_cost: 0.2290, reader_cost: 0.10392, ips: 17.4683 samples/sec | ETA 00:37:33
2022-07-21 15:53:37 [INFO]	[TRAIN] epoch: 3, iter: 10170/20000, loss: 0.3323, lr: 0.005277, batch_cost: 0.2395, reader_cost: 0.11741, ips: 16.7047 samples/sec | ETA 00:39:13
2022-07-21 15:53:39 [INFO]	[TRAIN] epoch: 3, iter: 10180/20000, loss: 0.4802, lr: 0.005272, batch_cost: 0.2095, reader_cost: 0.08479, ips: 19.0976 samples/sec | ETA 00:34:16
2022-07-21 15:53:42 [INFO]	[TRAIN] epoch: 3, iter: 10190/20000, loss: 0.4059, lr: 0.005268, batch_cost: 0.2396, reader_cost: 0.10577, ips: 16.6920 samples/sec | ETA 00:39:10
2022-07-21 15:53:44 [INFO]	[TRAIN] epoch: 3, iter: 10200/20000, loss: 0.4751, lr: 0.005263, batch_cost: 0.2310, reader_cost: 0.10209, ips: 17.3184 samples/sec | ETA 00:37:43
2022-07-21 15:53:46 [INFO]	[TRAIN] epoch: 3, iter: 10210/20000, loss: 0.3373, lr: 0.005258, batch_cost: 0.2412, reader_cost: 0.11351, ips: 16.5824 samples/sec | ETA 00:39:21
2022-07-21 15:53:49 [INFO]	[TRAIN] epoch: 3, iter: 10220/20000, loss: 0.3796, lr: 0.005253, batch_cost: 0.2572, reader_cost: 0.10753, ips: 15.5544 samples/sec | ETA 00:41:55
2022-07-21 15:53:52 [INFO]	[TRAIN] epoch: 3, iter: 10230/20000, loss: 0.2732, lr: 0.005248, batch_cost: 0.2907, reader_cost: 0.14148, ips: 13.7601 samples/sec | ETA 00:47:20
2022-07-21 15:53:54 [INFO]	[TRAIN] epoch: 3, iter: 10240/20000, loss: 0.4746, lr: 0.005243, batch_cost: 0.2260, reader_cost: 0.10285, ips: 17.6980 samples/sec | ETA 00:36:45
2022-07-21 15:53:57 [INFO]	[TRAIN] epoch: 3, iter: 10250/20000, loss: 0.3503, lr: 0.005239, batch_cost: 0.2385, reader_cost: 0.10945, ips: 16.7731 samples/sec | ETA 00:38:45
2022-07-21 15:54:00 [INFO]	[TRAIN] epoch: 3, iter: 10260/20000, loss: 0.3372, lr: 0.005234, batch_cost: 0.2941, reader_cost: 0.14212, ips: 13.6023 samples/sec | ETA 00:47:44
2022-07-21 15:54:03 [INFO]	[TRAIN] epoch: 3, iter: 10270/20000, loss: 0.3882, lr: 0.005229, batch_cost: 0.3358, reader_cost: 0.15928, ips: 11.9105 samples/sec | ETA 00:54:27
2022-07-21 15:54:05 [INFO]	[TRAIN] epoch: 3, iter: 10280/20000, loss: 0.2789, lr: 0.005224, batch_cost: 0.2406, reader_cost: 0.11797, ips: 16.6285 samples/sec | ETA 00:38:58
2022-07-21 15:54:08 [INFO]	[TRAIN] epoch: 3, iter: 10290/20000, loss: 0.5647, lr: 0.005219, batch_cost: 0.2501, reader_cost: 0.12470, ips: 15.9940 samples/sec | ETA 00:40:28
2022-07-21 15:54:10 [INFO]	[TRAIN] epoch: 3, iter: 10300/20000, loss: 0.3400, lr: 0.005214, batch_cost: 0.2398, reader_cost: 0.11780, ips: 16.6792 samples/sec | ETA 00:38:46
2022-07-21 15:54:12 [INFO]	[TRAIN] epoch: 3, iter: 10310/20000, loss: 0.3769, lr: 0.005210, batch_cost: 0.2310, reader_cost: 0.10378, ips: 17.3141 samples/sec | ETA 00:37:18
2022-07-21 15:54:15 [INFO]	[TRAIN] epoch: 3, iter: 10320/20000, loss: 0.3506, lr: 0.005205, batch_cost: 0.2356, reader_cost: 0.10864, ips: 16.9767 samples/sec | ETA 00:38:00
2022-07-21 15:54:17 [INFO]	[TRAIN] epoch: 3, iter: 10330/20000, loss: 0.3014, lr: 0.005200, batch_cost: 0.2316, reader_cost: 0.10440, ips: 17.2733 samples/sec | ETA 00:37:19
2022-07-21 15:54:20 [INFO]	[TRAIN] epoch: 3, iter: 10340/20000, loss: 0.2694, lr: 0.005195, batch_cost: 0.2560, reader_cost: 0.11977, ips: 15.6275 samples/sec | ETA 00:41:12
2022-07-21 15:54:22 [INFO]	[TRAIN] epoch: 3, iter: 10350/20000, loss: 0.4047, lr: 0.005190, batch_cost: 0.2604, reader_cost: 0.11892, ips: 15.3624 samples/sec | ETA 00:41:52
2022-07-21 15:54:25 [INFO]	[TRAIN] epoch: 3, iter: 10360/20000, loss: 0.2904, lr: 0.005185, batch_cost: 0.2414, reader_cost: 0.11777, ips: 16.5733 samples/sec | ETA 00:38:46
2022-07-21 15:54:27 [INFO]	[TRAIN] epoch: 3, iter: 10370/20000, loss: 0.3791, lr: 0.005181, batch_cost: 0.2288, reader_cost: 0.10654, ips: 17.4815 samples/sec | ETA 00:36:43
2022-07-21 15:54:29 [INFO]	[TRAIN] epoch: 3, iter: 10380/20000, loss: 0.3920, lr: 0.005176, batch_cost: 0.2410, reader_cost: 0.11813, ips: 16.5955 samples/sec | ETA 00:38:38
2022-07-21 15:54:32 [INFO]	[TRAIN] epoch: 3, iter: 10390/20000, loss: 0.3336, lr: 0.005171, batch_cost: 0.2230, reader_cost: 0.09472, ips: 17.9339 samples/sec | ETA 00:35:43
2022-07-21 15:54:34 [INFO]	[TRAIN] epoch: 3, iter: 10400/20000, loss: 0.3088, lr: 0.005166, batch_cost: 0.2232, reader_cost: 0.09979, ips: 17.9245 samples/sec | ETA 00:35:42
2022-07-21 15:54:36 [INFO]	[TRAIN] epoch: 3, iter: 10410/20000, loss: 0.3351, lr: 0.005161, batch_cost: 0.2323, reader_cost: 0.10919, ips: 17.2224 samples/sec | ETA 00:37:07
2022-07-21 15:54:39 [INFO]	[TRAIN] epoch: 3, iter: 10420/20000, loss: 0.3815, lr: 0.005156, batch_cost: 0.2383, reader_cost: 0.11296, ips: 16.7848 samples/sec | ETA 00:38:03
2022-07-21 15:54:41 [INFO]	[TRAIN] epoch: 3, iter: 10430/20000, loss: 0.2595, lr: 0.005152, batch_cost: 0.2350, reader_cost: 0.11235, ips: 17.0211 samples/sec | ETA 00:37:28
2022-07-21 15:54:43 [INFO]	[TRAIN] epoch: 3, iter: 10440/20000, loss: 0.3088, lr: 0.005147, batch_cost: 0.2350, reader_cost: 0.11090, ips: 17.0202 samples/sec | ETA 00:37:26
2022-07-21 15:54:46 [INFO]	[TRAIN] epoch: 3, iter: 10450/20000, loss: 0.4280, lr: 0.005142, batch_cost: 0.2510, reader_cost: 0.12292, ips: 15.9372 samples/sec | ETA 00:39:56
2022-07-21 15:54:48 [INFO]	[TRAIN] epoch: 3, iter: 10460/20000, loss: 0.3732, lr: 0.005137, batch_cost: 0.2471, reader_cost: 0.12032, ips: 16.1857 samples/sec | ETA 00:39:17
2022-07-21 15:54:51 [INFO]	[TRAIN] epoch: 3, iter: 10470/20000, loss: 0.4362, lr: 0.005132, batch_cost: 0.2267, reader_cost: 0.10085, ips: 17.6419 samples/sec | ETA 00:36:00
2022-07-21 15:54:54 [INFO]	[TRAIN] epoch: 3, iter: 10480/20000, loss: 0.3784, lr: 0.005127, batch_cost: 0.3331, reader_cost: 0.17735, ips: 12.0078 samples/sec | ETA 00:52:51
2022-07-21 15:54:57 [INFO]	[TRAIN] epoch: 3, iter: 10490/20000, loss: 0.4448, lr: 0.005122, batch_cost: 0.2652, reader_cost: 0.13542, ips: 15.0831 samples/sec | ETA 00:42:02
2022-07-21 15:54:59 [INFO]	[TRAIN] epoch: 3, iter: 10500/20000, loss: 0.4418, lr: 0.005118, batch_cost: 0.2290, reader_cost: 0.10606, ips: 17.4706 samples/sec | ETA 00:36:15
2022-07-21 15:55:01 [INFO]	[TRAIN] epoch: 3, iter: 10510/20000, loss: 0.5078, lr: 0.005113, batch_cost: 0.2308, reader_cost: 0.10844, ips: 17.3320 samples/sec | ETA 00:36:30
2022-07-21 15:55:04 [INFO]	[TRAIN] epoch: 3, iter: 10520/20000, loss: 0.2750, lr: 0.005108, batch_cost: 0.2409, reader_cost: 0.11809, ips: 16.6065 samples/sec | ETA 00:38:03
2022-07-21 15:55:06 [INFO]	[TRAIN] epoch: 3, iter: 10530/20000, loss: 0.4096, lr: 0.005103, batch_cost: 0.2558, reader_cost: 0.11244, ips: 15.6394 samples/sec | ETA 00:40:22
2022-07-21 15:55:10 [INFO]	[TRAIN] epoch: 3, iter: 10540/20000, loss: 0.4072, lr: 0.005098, batch_cost: 0.3609, reader_cost: 0.15664, ips: 11.0831 samples/sec | ETA 00:56:54
2022-07-21 15:55:12 [INFO]	[TRAIN] epoch: 3, iter: 10550/20000, loss: 0.3686, lr: 0.005093, batch_cost: 0.2410, reader_cost: 0.11334, ips: 16.6008 samples/sec | ETA 00:37:57
2022-07-21 15:55:14 [INFO]	[TRAIN] epoch: 3, iter: 10560/20000, loss: 0.4262, lr: 0.005088, batch_cost: 0.2265, reader_cost: 0.09857, ips: 17.6614 samples/sec | ETA 00:35:37
2022-07-21 15:55:17 [INFO]	[TRAIN] epoch: 3, iter: 10570/20000, loss: 0.2958, lr: 0.005084, batch_cost: 0.2283, reader_cost: 0.10472, ips: 17.5177 samples/sec | ETA 00:35:53
2022-07-21 15:55:19 [INFO]	[TRAIN] epoch: 3, iter: 10580/20000, loss: 0.3897, lr: 0.005079, batch_cost: 0.2086, reader_cost: 0.08689, ips: 19.1788 samples/sec | ETA 00:32:44
2022-07-21 15:55:21 [INFO]	[TRAIN] epoch: 3, iter: 10590/20000, loss: 0.3123, lr: 0.005074, batch_cost: 0.2475, reader_cost: 0.12526, ips: 16.1601 samples/sec | ETA 00:38:49
2022-07-21 15:55:24 [INFO]	[TRAIN] epoch: 3, iter: 10600/20000, loss: 0.4368, lr: 0.005069, batch_cost: 0.2748, reader_cost: 0.13262, ips: 14.5583 samples/sec | ETA 00:43:02
2022-07-21 15:55:26 [INFO]	[TRAIN] epoch: 3, iter: 10610/20000, loss: 0.3426, lr: 0.005064, batch_cost: 0.2439, reader_cost: 0.11943, ips: 16.4035 samples/sec | ETA 00:38:09
2022-07-21 15:55:29 [INFO]	[TRAIN] epoch: 3, iter: 10620/20000, loss: 0.3202, lr: 0.005059, batch_cost: 0.2388, reader_cost: 0.11636, ips: 16.7510 samples/sec | ETA 00:37:19
2022-07-21 15:55:31 [INFO]	[TRAIN] epoch: 3, iter: 10630/20000, loss: 0.2912, lr: 0.005055, batch_cost: 0.2378, reader_cost: 0.11496, ips: 16.8231 samples/sec | ETA 00:37:07
2022-07-21 15:55:34 [INFO]	[TRAIN] epoch: 3, iter: 10640/20000, loss: 0.3233, lr: 0.005050, batch_cost: 0.2386, reader_cost: 0.11546, ips: 16.7642 samples/sec | ETA 00:37:13
2022-07-21 15:55:36 [INFO]	[TRAIN] epoch: 3, iter: 10650/20000, loss: 0.3079, lr: 0.005045, batch_cost: 0.2278, reader_cost: 0.10596, ips: 17.5566 samples/sec | ETA 00:35:30
2022-07-21 15:55:38 [INFO]	[TRAIN] epoch: 3, iter: 10660/20000, loss: 0.3593, lr: 0.005040, batch_cost: 0.2401, reader_cost: 0.11651, ips: 16.6612 samples/sec | ETA 00:37:22
2022-07-21 15:55:41 [INFO]	[TRAIN] epoch: 3, iter: 10670/20000, loss: 0.4755, lr: 0.005035, batch_cost: 0.2249, reader_cost: 0.10180, ips: 17.7819 samples/sec | ETA 00:34:58
2022-07-21 15:55:43 [INFO]	[TRAIN] epoch: 3, iter: 10680/20000, loss: 0.4357, lr: 0.005030, batch_cost: 0.2271, reader_cost: 0.09812, ips: 17.6121 samples/sec | ETA 00:35:16
2022-07-21 15:55:45 [INFO]	[TRAIN] epoch: 3, iter: 10690/20000, loss: 0.3285, lr: 0.005025, batch_cost: 0.2585, reader_cost: 0.12330, ips: 15.4711 samples/sec | ETA 00:40:07
2022-07-21 15:55:48 [INFO]	[TRAIN] epoch: 3, iter: 10700/20000, loss: 0.3483, lr: 0.005021, batch_cost: 0.2360, reader_cost: 0.10595, ips: 16.9480 samples/sec | ETA 00:36:34
2022-07-21 15:55:50 [INFO]	[TRAIN] epoch: 3, iter: 10710/20000, loss: 0.3591, lr: 0.005016, batch_cost: 0.2230, reader_cost: 0.09556, ips: 17.9350 samples/sec | ETA 00:34:31
2022-07-21 15:55:52 [INFO]	[TRAIN] epoch: 3, iter: 10720/20000, loss: 0.3940, lr: 0.005011, batch_cost: 0.2351, reader_cost: 0.10884, ips: 17.0162 samples/sec | ETA 00:36:21
2022-07-21 15:55:55 [INFO]	[TRAIN] epoch: 3, iter: 10730/20000, loss: 0.2967, lr: 0.005006, batch_cost: 0.2581, reader_cost: 0.09984, ips: 15.4986 samples/sec | ETA 00:39:52
2022-07-21 15:55:58 [INFO]	[TRAIN] epoch: 3, iter: 10740/20000, loss: 0.4202, lr: 0.005001, batch_cost: 0.3204, reader_cost: 0.14373, ips: 12.4847 samples/sec | ETA 00:49:26
2022-07-21 15:56:00 [INFO]	[TRAIN] epoch: 3, iter: 10750/20000, loss: 0.3601, lr: 0.004996, batch_cost: 0.2311, reader_cost: 0.10619, ips: 17.3072 samples/sec | ETA 00:35:37
2022-07-21 15:56:03 [INFO]	[TRAIN] epoch: 3, iter: 10760/20000, loss: 0.3168, lr: 0.004991, batch_cost: 0.2322, reader_cost: 0.10846, ips: 17.2287 samples/sec | ETA 00:35:45
2022-07-21 15:56:05 [INFO]	[TRAIN] epoch: 3, iter: 10770/20000, loss: 0.2984, lr: 0.004987, batch_cost: 0.2201, reader_cost: 0.09724, ips: 18.1719 samples/sec | ETA 00:33:51
2022-07-21 15:56:07 [INFO]	[TRAIN] epoch: 3, iter: 10780/20000, loss: 0.4584, lr: 0.004982, batch_cost: 0.2130, reader_cost: 0.08798, ips: 18.7773 samples/sec | ETA 00:32:44
2022-07-21 15:56:09 [INFO]	[TRAIN] epoch: 3, iter: 10790/20000, loss: 0.4133, lr: 0.004977, batch_cost: 0.2290, reader_cost: 0.10402, ips: 17.4689 samples/sec | ETA 00:35:08
2022-07-21 15:56:12 [INFO]	[TRAIN] epoch: 3, iter: 10800/20000, loss: 0.3553, lr: 0.004972, batch_cost: 0.2545, reader_cost: 0.12877, ips: 15.7181 samples/sec | ETA 00:39:01
2022-07-21 15:56:15 [INFO]	[TRAIN] epoch: 3, iter: 10810/20000, loss: 0.4532, lr: 0.004967, batch_cost: 0.3258, reader_cost: 0.16190, ips: 12.2783 samples/sec | ETA 00:49:53
2022-07-21 15:56:18 [INFO]	[TRAIN] epoch: 3, iter: 10820/20000, loss: 0.5089, lr: 0.004962, batch_cost: 0.3150, reader_cost: 0.14615, ips: 12.6987 samples/sec | ETA 00:48:11
2022-07-21 15:56:21 [INFO]	[TRAIN] epoch: 3, iter: 10830/20000, loss: 0.3970, lr: 0.004957, batch_cost: 0.2382, reader_cost: 0.11346, ips: 16.7921 samples/sec | ETA 00:36:24
2022-07-21 15:56:23 [INFO]	[TRAIN] epoch: 3, iter: 10840/20000, loss: 0.4134, lr: 0.004952, batch_cost: 0.2375, reader_cost: 0.11493, ips: 16.8393 samples/sec | ETA 00:36:15
2022-07-21 15:56:26 [INFO]	[TRAIN] epoch: 3, iter: 10850/20000, loss: 0.3937, lr: 0.004948, batch_cost: 0.2406, reader_cost: 0.11623, ips: 16.6255 samples/sec | ETA 00:36:41
2022-07-21 15:56:28 [INFO]	[TRAIN] epoch: 3, iter: 10860/20000, loss: 0.2770, lr: 0.004943, batch_cost: 0.2811, reader_cost: 0.13504, ips: 14.2323 samples/sec | ETA 00:42:48
2022-07-21 15:56:31 [INFO]	[TRAIN] epoch: 3, iter: 10870/20000, loss: 0.3401, lr: 0.004938, batch_cost: 0.2472, reader_cost: 0.12098, ips: 16.1807 samples/sec | ETA 00:37:37
2022-07-21 15:56:33 [INFO]	[TRAIN] epoch: 3, iter: 10880/20000, loss: 0.3897, lr: 0.004933, batch_cost: 0.2374, reader_cost: 0.11579, ips: 16.8493 samples/sec | ETA 00:36:05
2022-07-21 15:56:36 [INFO]	[TRAIN] epoch: 3, iter: 10890/20000, loss: 0.3537, lr: 0.004928, batch_cost: 0.2343, reader_cost: 0.11153, ips: 17.0733 samples/sec | ETA 00:35:34
2022-07-21 15:56:38 [INFO]	[TRAIN] epoch: 3, iter: 10900/20000, loss: 0.2812, lr: 0.004923, batch_cost: 0.2284, reader_cost: 0.10619, ips: 17.5164 samples/sec | ETA 00:34:38
2022-07-21 15:56:40 [INFO]	[TRAIN] epoch: 3, iter: 10910/20000, loss: 0.2629, lr: 0.004918, batch_cost: 0.2351, reader_cost: 0.11163, ips: 17.0144 samples/sec | ETA 00:35:37
2022-07-21 15:56:43 [INFO]	[TRAIN] epoch: 3, iter: 10920/20000, loss: 0.3870, lr: 0.004914, batch_cost: 0.2523, reader_cost: 0.11973, ips: 15.8538 samples/sec | ETA 00:38:10
2022-07-21 15:56:45 [INFO]	[TRAIN] epoch: 3, iter: 10930/20000, loss: 0.4101, lr: 0.004909, batch_cost: 0.2524, reader_cost: 0.12279, ips: 15.8486 samples/sec | ETA 00:38:09
2022-07-21 15:56:48 [INFO]	[TRAIN] epoch: 3, iter: 10940/20000, loss: 0.3804, lr: 0.004904, batch_cost: 0.2538, reader_cost: 0.12336, ips: 15.7621 samples/sec | ETA 00:38:19
2022-07-21 15:56:50 [INFO]	[TRAIN] epoch: 3, iter: 10950/20000, loss: 0.3426, lr: 0.004899, batch_cost: 0.2415, reader_cost: 0.11585, ips: 16.5661 samples/sec | ETA 00:36:25
2022-07-21 15:56:52 [INFO]	[TRAIN] epoch: 3, iter: 10960/20000, loss: 0.3328, lr: 0.004894, batch_cost: 0.2280, reader_cost: 0.10674, ips: 17.5467 samples/sec | ETA 00:34:20
2022-07-21 15:56:55 [INFO]	[TRAIN] epoch: 3, iter: 10970/20000, loss: 0.4474, lr: 0.004889, batch_cost: 0.2472, reader_cost: 0.11906, ips: 16.1843 samples/sec | ETA 00:37:11
2022-07-21 15:56:57 [INFO]	[TRAIN] epoch: 3, iter: 10980/20000, loss: 0.2909, lr: 0.004884, batch_cost: 0.2302, reader_cost: 0.10617, ips: 17.3749 samples/sec | ETA 00:34:36
2022-07-21 15:57:00 [INFO]	[TRAIN] epoch: 3, iter: 10990/20000, loss: 0.4413, lr: 0.004879, batch_cost: 0.3208, reader_cost: 0.15816, ips: 12.4702 samples/sec | ETA 00:48:10
2022-07-21 15:57:03 [INFO]	[TRAIN] epoch: 3, iter: 11000/20000, loss: 0.3464, lr: 0.004875, batch_cost: 0.2964, reader_cost: 0.13717, ips: 13.4961 samples/sec | ETA 00:44:27
2022-07-21 15:57:06 [INFO]	[TRAIN] epoch: 3, iter: 11010/20000, loss: 0.3409, lr: 0.004870, batch_cost: 0.2245, reader_cost: 0.10200, ips: 17.8149 samples/sec | ETA 00:33:38
2022-07-21 15:57:08 [INFO]	[TRAIN] epoch: 3, iter: 11020/20000, loss: 0.3205, lr: 0.004865, batch_cost: 0.2352, reader_cost: 0.11222, ips: 17.0040 samples/sec | ETA 00:35:12
2022-07-21 15:57:10 [INFO]	[TRAIN] epoch: 3, iter: 11030/20000, loss: 0.3511, lr: 0.004860, batch_cost: 0.2361, reader_cost: 0.11405, ips: 16.9453 samples/sec | ETA 00:35:17
2022-07-21 15:57:13 [INFO]	[TRAIN] epoch: 3, iter: 11040/20000, loss: 0.3380, lr: 0.004855, batch_cost: 0.2315, reader_cost: 0.10828, ips: 17.2815 samples/sec | ETA 00:34:33
2022-07-21 15:57:15 [INFO]	[TRAIN] epoch: 3, iter: 11050/20000, loss: 0.4317, lr: 0.004850, batch_cost: 0.2415, reader_cost: 0.11677, ips: 16.5648 samples/sec | ETA 00:36:01
2022-07-21 15:57:17 [INFO]	[TRAIN] epoch: 3, iter: 11060/20000, loss: 0.3769, lr: 0.004845, batch_cost: 0.2237, reader_cost: 0.10177, ips: 17.8787 samples/sec | ETA 00:33:20
2022-07-21 15:57:20 [INFO]	[TRAIN] epoch: 3, iter: 11070/20000, loss: 0.4009, lr: 0.004840, batch_cost: 0.2288, reader_cost: 0.09976, ips: 17.4833 samples/sec | ETA 00:34:03
2022-07-21 15:57:24 [INFO]	[TRAIN] epoch: 3, iter: 11080/20000, loss: 0.3407, lr: 0.004836, batch_cost: 0.3922, reader_cost: 0.20090, ips: 10.1988 samples/sec | ETA 00:58:18
2022-07-21 15:57:26 [INFO]	[TRAIN] epoch: 3, iter: 11090/20000, loss: 0.3852, lr: 0.004831, batch_cost: 0.2595, reader_cost: 0.11877, ips: 15.4113 samples/sec | ETA 00:38:32
2022-07-21 15:57:28 [INFO]	[TRAIN] epoch: 3, iter: 11100/20000, loss: 0.2889, lr: 0.004826, batch_cost: 0.2349, reader_cost: 0.10722, ips: 17.0307 samples/sec | ETA 00:34:50
2022-07-21 15:57:31 [INFO]	[TRAIN] epoch: 3, iter: 11110/20000, loss: 0.2835, lr: 0.004821, batch_cost: 0.2804, reader_cost: 0.13581, ips: 14.2651 samples/sec | ETA 00:41:32
2022-07-21 15:57:34 [INFO]	[TRAIN] epoch: 3, iter: 11120/20000, loss: 0.4029, lr: 0.004816, batch_cost: 0.2270, reader_cost: 0.10339, ips: 17.6192 samples/sec | ETA 00:33:35
2022-07-21 15:57:36 [INFO]	[TRAIN] epoch: 3, iter: 11130/20000, loss: 0.3379, lr: 0.004811, batch_cost: 0.2266, reader_cost: 0.10365, ips: 17.6507 samples/sec | ETA 00:33:30
2022-07-21 15:57:38 [INFO]	[TRAIN] epoch: 3, iter: 11140/20000, loss: 0.3615, lr: 0.004806, batch_cost: 0.2238, reader_cost: 0.10295, ips: 17.8735 samples/sec | ETA 00:33:02
2022-07-21 15:57:40 [INFO]	[TRAIN] epoch: 3, iter: 11150/20000, loss: 0.3973, lr: 0.004801, batch_cost: 0.2343, reader_cost: 0.10983, ips: 17.0699 samples/sec | ETA 00:34:33
2022-07-21 15:57:43 [INFO]	[TRAIN] epoch: 3, iter: 11160/20000, loss: 0.3186, lr: 0.004796, batch_cost: 0.2280, reader_cost: 0.10207, ips: 17.5435 samples/sec | ETA 00:33:35
2022-07-21 15:57:45 [INFO]	[TRAIN] epoch: 3, iter: 11170/20000, loss: 0.3561, lr: 0.004792, batch_cost: 0.2598, reader_cost: 0.13363, ips: 15.3943 samples/sec | ETA 00:38:14
2022-07-21 15:57:48 [INFO]	[TRAIN] epoch: 3, iter: 11180/20000, loss: 0.3956, lr: 0.004787, batch_cost: 0.2450, reader_cost: 0.11123, ips: 16.3279 samples/sec | ETA 00:36:00
2022-07-21 15:57:50 [INFO]	[TRAIN] epoch: 3, iter: 11190/20000, loss: 0.3258, lr: 0.004782, batch_cost: 0.2264, reader_cost: 0.09722, ips: 17.6708 samples/sec | ETA 00:33:14
2022-07-21 15:57:52 [INFO]	[TRAIN] epoch: 3, iter: 11200/20000, loss: 0.3449, lr: 0.004777, batch_cost: 0.2251, reader_cost: 0.10289, ips: 17.7662 samples/sec | ETA 00:33:01
2022-07-21 15:57:55 [INFO]	[TRAIN] epoch: 3, iter: 11210/20000, loss: 0.3543, lr: 0.004772, batch_cost: 0.2351, reader_cost: 0.11186, ips: 17.0167 samples/sec | ETA 00:34:26
2022-07-21 15:57:57 [INFO]	[TRAIN] epoch: 3, iter: 11220/20000, loss: 0.3273, lr: 0.004767, batch_cost: 0.2311, reader_cost: 0.10625, ips: 17.3112 samples/sec | ETA 00:33:48
2022-07-21 15:57:59 [INFO]	[TRAIN] epoch: 3, iter: 11230/20000, loss: 0.2933, lr: 0.004762, batch_cost: 0.2267, reader_cost: 0.10459, ips: 17.6409 samples/sec | ETA 00:33:08
2022-07-21 15:58:02 [INFO]	[TRAIN] epoch: 3, iter: 11240/20000, loss: 0.3187, lr: 0.004757, batch_cost: 0.2527, reader_cost: 0.12374, ips: 15.8276 samples/sec | ETA 00:36:53
2022-07-21 15:58:05 [INFO]	[TRAIN] epoch: 3, iter: 11250/20000, loss: 0.4843, lr: 0.004753, batch_cost: 0.3389, reader_cost: 0.16831, ips: 11.8028 samples/sec | ETA 00:49:25
2022-07-21 15:58:08 [INFO]	[TRAIN] epoch: 3, iter: 11260/20000, loss: 0.3408, lr: 0.004748, batch_cost: 0.2812, reader_cost: 0.13217, ips: 14.2244 samples/sec | ETA 00:40:57
2022-07-21 15:58:10 [INFO]	[TRAIN] epoch: 3, iter: 11270/20000, loss: 0.3243, lr: 0.004743, batch_cost: 0.2385, reader_cost: 0.10677, ips: 16.7682 samples/sec | ETA 00:34:42
2022-07-21 15:58:13 [INFO]	[TRAIN] epoch: 3, iter: 11280/20000, loss: 0.3574, lr: 0.004738, batch_cost: 0.2384, reader_cost: 0.11048, ips: 16.7754 samples/sec | ETA 00:34:39
2022-07-21 15:58:15 [INFO]	[TRAIN] epoch: 3, iter: 11290/20000, loss: 0.3157, lr: 0.004733, batch_cost: 0.2441, reader_cost: 0.11981, ips: 16.3860 samples/sec | ETA 00:35:26
2022-07-21 15:58:17 [INFO]	[TRAIN] epoch: 3, iter: 11300/20000, loss: 0.5059, lr: 0.004728, batch_cost: 0.2214, reader_cost: 0.09555, ips: 18.0666 samples/sec | ETA 00:32:06
2022-07-21 15:58:20 [INFO]	[TRAIN] epoch: 3, iter: 11310/20000, loss: 0.4382, lr: 0.004723, batch_cost: 0.2311, reader_cost: 0.10699, ips: 17.3106 samples/sec | ETA 00:33:28
2022-07-21 15:58:22 [INFO]	[TRAIN] epoch: 3, iter: 11320/20000, loss: 0.2926, lr: 0.004718, batch_cost: 0.2631, reader_cost: 0.13541, ips: 15.2007 samples/sec | ETA 00:38:04
2022-07-21 15:58:25 [INFO]	[TRAIN] epoch: 3, iter: 11330/20000, loss: 0.3461, lr: 0.004713, batch_cost: 0.2410, reader_cost: 0.11314, ips: 16.5994 samples/sec | ETA 00:34:49
2022-07-21 15:58:27 [INFO]	[TRAIN] epoch: 3, iter: 11340/20000, loss: 0.3426, lr: 0.004709, batch_cost: 0.2300, reader_cost: 0.09778, ips: 17.3926 samples/sec | ETA 00:33:11
2022-07-21 15:58:31 [INFO]	[TRAIN] epoch: 3, iter: 11350/20000, loss: 0.3127, lr: 0.004704, batch_cost: 0.4102, reader_cost: 0.21323, ips: 9.7512 samples/sec | ETA 00:59:08
2022-07-21 15:58:34 [INFO]	[TRAIN] epoch: 3, iter: 11360/20000, loss: 0.3642, lr: 0.004699, batch_cost: 0.2606, reader_cost: 0.12482, ips: 15.3472 samples/sec | ETA 00:37:31
2022-07-21 15:58:36 [INFO]	[TRAIN] epoch: 3, iter: 11370/20000, loss: 0.3804, lr: 0.004694, batch_cost: 0.2550, reader_cost: 0.11853, ips: 15.6877 samples/sec | ETA 00:36:40
2022-07-21 15:58:39 [INFO]	[TRAIN] epoch: 3, iter: 11380/20000, loss: 0.4474, lr: 0.004689, batch_cost: 0.2274, reader_cost: 0.10640, ips: 17.5882 samples/sec | ETA 00:32:40
2022-07-21 15:58:41 [INFO]	[TRAIN] epoch: 3, iter: 11390/20000, loss: 0.3405, lr: 0.004684, batch_cost: 0.2463, reader_cost: 0.11880, ips: 16.2429 samples/sec | ETA 00:35:20
2022-07-21 15:58:43 [INFO]	[TRAIN] epoch: 3, iter: 11400/20000, loss: 0.4528, lr: 0.004679, batch_cost: 0.2476, reader_cost: 0.11981, ips: 16.1581 samples/sec | ETA 00:35:28
2022-07-21 15:58:46 [INFO]	[TRAIN] epoch: 3, iter: 11410/20000, loss: 0.4124, lr: 0.004674, batch_cost: 0.2314, reader_cost: 0.10401, ips: 17.2842 samples/sec | ETA 00:33:07
2022-07-21 15:58:48 [INFO]	[TRAIN] epoch: 3, iter: 11420/20000, loss: 0.3971, lr: 0.004669, batch_cost: 0.2360, reader_cost: 0.11122, ips: 16.9475 samples/sec | ETA 00:33:45
2022-07-21 15:58:50 [INFO]	[TRAIN] epoch: 3, iter: 11430/20000, loss: 0.3674, lr: 0.004664, batch_cost: 0.2294, reader_cost: 0.10357, ips: 17.4379 samples/sec | ETA 00:32:45
2022-07-21 15:58:53 [INFO]	[TRAIN] epoch: 3, iter: 11440/20000, loss: 0.3899, lr: 0.004660, batch_cost: 0.2509, reader_cost: 0.12806, ips: 15.9408 samples/sec | ETA 00:35:47
2022-07-21 15:58:55 [INFO]	[TRAIN] epoch: 3, iter: 11450/20000, loss: 0.4188, lr: 0.004655, batch_cost: 0.2342, reader_cost: 0.10666, ips: 17.0805 samples/sec | ETA 00:33:22
2022-07-21 15:58:58 [INFO]	[TRAIN] epoch: 3, iter: 11460/20000, loss: 0.3202, lr: 0.004650, batch_cost: 0.2343, reader_cost: 0.10748, ips: 17.0691 samples/sec | ETA 00:33:21
2022-07-21 15:59:00 [INFO]	[TRAIN] epoch: 3, iter: 11470/20000, loss: 0.3427, lr: 0.004645, batch_cost: 0.2267, reader_cost: 0.09805, ips: 17.6457 samples/sec | ETA 00:32:13
2022-07-21 15:59:02 [INFO]	[TRAIN] epoch: 3, iter: 11480/20000, loss: 0.4160, lr: 0.004640, batch_cost: 0.2412, reader_cost: 0.11435, ips: 16.5817 samples/sec | ETA 00:34:15
2022-07-21 15:59:05 [INFO]	[TRAIN] epoch: 3, iter: 11490/20000, loss: 0.4208, lr: 0.004635, batch_cost: 0.2252, reader_cost: 0.09405, ips: 17.7592 samples/sec | ETA 00:31:56
2022-07-21 15:59:07 [INFO]	[TRAIN] epoch: 3, iter: 11500/20000, loss: 0.4001, lr: 0.004630, batch_cost: 0.2853, reader_cost: 0.12004, ips: 14.0191 samples/sec | ETA 00:40:25
2022-07-21 15:59:10 [INFO]	[TRAIN] epoch: 3, iter: 11510/20000, loss: 0.2887, lr: 0.004625, batch_cost: 0.2841, reader_cost: 0.13407, ips: 14.0791 samples/sec | ETA 00:40:12
2022-07-21 15:59:13 [INFO]	[TRAIN] epoch: 3, iter: 11520/20000, loss: 0.4400, lr: 0.004620, batch_cost: 0.2611, reader_cost: 0.13305, ips: 15.3206 samples/sec | ETA 00:36:54
2022-07-21 15:59:15 [INFO]	[TRAIN] epoch: 3, iter: 11530/20000, loss: 0.2940, lr: 0.004615, batch_cost: 0.2542, reader_cost: 0.12828, ips: 15.7326 samples/sec | ETA 00:35:53
2022-07-21 15:59:18 [INFO]	[TRAIN] epoch: 3, iter: 11540/20000, loss: 0.6789, lr: 0.004611, batch_cost: 0.2305, reader_cost: 0.10695, ips: 17.3571 samples/sec | ETA 00:32:29
2022-07-21 15:59:20 [INFO]	[TRAIN] epoch: 3, iter: 11550/20000, loss: 0.4579, lr: 0.004606, batch_cost: 0.2321, reader_cost: 0.10805, ips: 17.2348 samples/sec | ETA 00:32:41
2022-07-21 15:59:23 [INFO]	[TRAIN] epoch: 3, iter: 11560/20000, loss: 0.2699, lr: 0.004601, batch_cost: 0.2460, reader_cost: 0.12131, ips: 16.2596 samples/sec | ETA 00:34:36
2022-07-21 15:59:25 [INFO]	[TRAIN] epoch: 3, iter: 11570/20000, loss: 0.3821, lr: 0.004596, batch_cost: 0.2404, reader_cost: 0.11531, ips: 16.6383 samples/sec | ETA 00:33:46
2022-07-21 15:59:27 [INFO]	[TRAIN] epoch: 3, iter: 11580/20000, loss: 0.3816, lr: 0.004591, batch_cost: 0.2312, reader_cost: 0.10315, ips: 17.3021 samples/sec | ETA 00:32:26
2022-07-21 15:59:29 [INFO]	[TRAIN] epoch: 3, iter: 11590/20000, loss: 0.2769, lr: 0.004586, batch_cost: 0.2255, reader_cost: 0.09796, ips: 17.7397 samples/sec | ETA 00:31:36
2022-07-21 15:59:32 [INFO]	[TRAIN] epoch: 3, iter: 11600/20000, loss: 0.4626, lr: 0.004581, batch_cost: 0.2255, reader_cost: 0.10080, ips: 17.7393 samples/sec | ETA 00:31:34
2022-07-21 15:59:34 [INFO]	[TRAIN] epoch: 3, iter: 11610/20000, loss: 0.3068, lr: 0.004576, batch_cost: 0.2444, reader_cost: 0.11191, ips: 16.3677 samples/sec | ETA 00:34:10
2022-07-21 15:59:38 [INFO]	[TRAIN] epoch: 3, iter: 11620/20000, loss: 0.4552, lr: 0.004571, batch_cost: 0.3900, reader_cost: 0.19875, ips: 10.2555 samples/sec | ETA 00:54:28
2022-07-21 15:59:41 [INFO]	[TRAIN] epoch: 3, iter: 11630/20000, loss: 0.3540, lr: 0.004566, batch_cost: 0.3137, reader_cost: 0.14616, ips: 12.7498 samples/sec | ETA 00:43:45
2022-07-21 15:59:44 [INFO]	[TRAIN] epoch: 3, iter: 11640/20000, loss: 0.4021, lr: 0.004561, batch_cost: 0.2303, reader_cost: 0.10378, ips: 17.3720 samples/sec | ETA 00:32:04
2022-07-21 15:59:46 [INFO]	[TRAIN] epoch: 3, iter: 11650/20000, loss: 0.3727, lr: 0.004557, batch_cost: 0.2389, reader_cost: 0.10913, ips: 16.7450 samples/sec | ETA 00:33:14
2022-07-21 15:59:48 [INFO]	[TRAIN] epoch: 3, iter: 11660/20000, loss: 0.3768, lr: 0.004552, batch_cost: 0.2454, reader_cost: 0.11898, ips: 16.2974 samples/sec | ETA 00:34:06
2022-07-21 15:59:51 [INFO]	[TRAIN] epoch: 3, iter: 11670/20000, loss: 0.2816, lr: 0.004547, batch_cost: 0.2287, reader_cost: 0.10642, ips: 17.4935 samples/sec | ETA 00:31:44
2022-07-21 15:59:53 [INFO]	[TRAIN] epoch: 3, iter: 11680/20000, loss: 0.3239, lr: 0.004542, batch_cost: 0.2245, reader_cost: 0.10052, ips: 17.8195 samples/sec | ETA 00:31:07
2022-07-21 15:59:55 [INFO]	[TRAIN] epoch: 3, iter: 11690/20000, loss: 0.2929, lr: 0.004537, batch_cost: 0.2342, reader_cost: 0.10558, ips: 17.0798 samples/sec | ETA 00:32:26
2022-07-21 15:59:58 [INFO]	[TRAIN] epoch: 3, iter: 11700/20000, loss: 0.3777, lr: 0.004532, batch_cost: 0.2442, reader_cost: 0.12139, ips: 16.3816 samples/sec | ETA 00:33:46
2022-07-21 16:00:00 [INFO]	[TRAIN] epoch: 3, iter: 11710/20000, loss: 0.4263, lr: 0.004527, batch_cost: 0.2396, reader_cost: 0.11834, ips: 16.6958 samples/sec | ETA 00:33:06
2022-07-21 16:00:03 [INFO]	[TRAIN] epoch: 3, iter: 11720/20000, loss: 0.2970, lr: 0.004522, batch_cost: 0.2516, reader_cost: 0.13054, ips: 15.8970 samples/sec | ETA 00:34:43
2022-07-21 16:00:05 [INFO]	[TRAIN] epoch: 3, iter: 11730/20000, loss: 0.4070, lr: 0.004517, batch_cost: 0.2510, reader_cost: 0.12928, ips: 15.9333 samples/sec | ETA 00:34:36
2022-07-21 16:00:08 [INFO]	[TRAIN] epoch: 3, iter: 11740/20000, loss: 0.2966, lr: 0.004512, batch_cost: 0.2467, reader_cost: 0.12470, ips: 16.2139 samples/sec | ETA 00:33:57
2022-07-21 16:00:11 [INFO]	[TRAIN] epoch: 3, iter: 11750/20000, loss: 0.5062, lr: 0.004507, batch_cost: 0.3104, reader_cost: 0.13841, ips: 12.8854 samples/sec | ETA 00:42:41
2022-07-21 16:00:13 [INFO]	[TRAIN] epoch: 3, iter: 11760/20000, loss: 0.3227, lr: 0.004503, batch_cost: 0.2585, reader_cost: 0.12437, ips: 15.4739 samples/sec | ETA 00:35:30
2022-07-21 16:00:16 [INFO]	[TRAIN] epoch: 3, iter: 11770/20000, loss: 0.2404, lr: 0.004498, batch_cost: 0.2373, reader_cost: 0.11060, ips: 16.8583 samples/sec | ETA 00:32:32
2022-07-21 16:00:18 [INFO]	[TRAIN] epoch: 3, iter: 11780/20000, loss: 0.2977, lr: 0.004493, batch_cost: 0.2398, reader_cost: 0.11536, ips: 16.6772 samples/sec | ETA 00:32:51
2022-07-21 16:00:21 [INFO]	[TRAIN] epoch: 3, iter: 11790/20000, loss: 0.3985, lr: 0.004488, batch_cost: 0.2569, reader_cost: 0.12939, ips: 15.5705 samples/sec | ETA 00:35:09
2022-07-21 16:00:23 [INFO]	[TRAIN] epoch: 3, iter: 11800/20000, loss: 0.2839, lr: 0.004483, batch_cost: 0.2504, reader_cost: 0.12570, ips: 15.9723 samples/sec | ETA 00:34:13
2022-07-21 16:00:26 [INFO]	[TRAIN] epoch: 3, iter: 11810/20000, loss: 0.3727, lr: 0.004478, batch_cost: 0.2437, reader_cost: 0.11802, ips: 16.4142 samples/sec | ETA 00:33:15
2022-07-21 16:00:28 [INFO]	[TRAIN] epoch: 3, iter: 11820/20000, loss: 0.3121, lr: 0.004473, batch_cost: 0.2370, reader_cost: 0.11204, ips: 16.8756 samples/sec | ETA 00:32:18
2022-07-21 16:00:30 [INFO]	[TRAIN] epoch: 3, iter: 11830/20000, loss: 0.3445, lr: 0.004468, batch_cost: 0.2374, reader_cost: 0.11001, ips: 16.8491 samples/sec | ETA 00:32:19
2022-07-21 16:00:33 [INFO]	[TRAIN] epoch: 3, iter: 11840/20000, loss: 0.4331, lr: 0.004463, batch_cost: 0.2298, reader_cost: 0.10818, ips: 17.4043 samples/sec | ETA 00:31:15
2022-07-21 16:00:35 [INFO]	[TRAIN] epoch: 3, iter: 11850/20000, loss: 0.4077, lr: 0.004458, batch_cost: 0.2210, reader_cost: 0.09929, ips: 18.0969 samples/sec | ETA 00:30:01
2022-07-21 16:00:37 [INFO]	[TRAIN] epoch: 3, iter: 11860/20000, loss: 0.3754, lr: 0.004453, batch_cost: 0.2210, reader_cost: 0.09842, ips: 18.0978 samples/sec | ETA 00:29:59
2022-07-21 16:00:39 [INFO]	[TRAIN] epoch: 3, iter: 11870/20000, loss: 0.4111, lr: 0.004448, batch_cost: 0.2288, reader_cost: 0.10703, ips: 17.4855 samples/sec | ETA 00:30:59
2022-07-21 16:00:42 [INFO]	[TRAIN] epoch: 3, iter: 11880/20000, loss: 0.3511, lr: 0.004443, batch_cost: 0.2682, reader_cost: 0.12566, ips: 14.9146 samples/sec | ETA 00:36:17
2022-07-21 16:00:47 [INFO]	[TRAIN] epoch: 3, iter: 11890/20000, loss: 0.2718, lr: 0.004439, batch_cost: 0.4704, reader_cost: 0.21734, ips: 8.5025 samples/sec | ETA 01:03:35
2022-07-21 16:00:49 [INFO]	[TRAIN] epoch: 3, iter: 11900/20000, loss: 0.3884, lr: 0.004434, batch_cost: 0.2747, reader_cost: 0.13509, ips: 14.5592 s
Logo

学大模型,用大模型上飞桨星河社区!每天8点V100G算力免费领!免费领取ERNIE 4.0 100w Token >>>

更多推荐