转载自AI Studio
项目链接https://aistudio.baidu.com/aistudio/projectdetail/2772561

赛题介绍

人脸识别是基于人的面部特征信息进行身份识别的一种生物识别技术,金融和安防是目前人脸识别应用最广泛的两个领域。人脸关键点是人脸识别中的关键技术。人脸关键点检测需要识别出人脸的指定位置坐标,例如眉毛、眼睛、鼻子、嘴巴和脸部轮廓等位置坐标等。

赛事任务

给定人脸图像,找到4个人脸关键点,赛题任务可以视为一个关键点检测问题。

  • 训练集:5千张人脸图像,并且给定了具体的人脸关键点标注。

  • 测试集:约2千张人脸图像,需要选手识别出具体的关键点位置。

数据说明

赛题数据由训练集和测试集组成,train.csv为训练集标注数据,train.npy和test.npy为训练集图片和测试集图片,可以使用numpy.load进行读取。

train.csv的信息为左眼坐标、右眼坐标、鼻子坐标和嘴巴坐标,总共8个点。

left_eye_center_x,left_eye_center_y,right_eye_center_x,right_eye_center_y,nose_tip_x,nose_tip_y,mouth_center_bottom_lip_x,mouth_center_bottom_lip_y
66.3423640449,38.5236134831,28.9308404494,35.5777725843,49.256844943800004,68.2759550562,47.783946067399995,85.3615820225
68.9126037736,31.409116981100002,29.652226415100003,33.0280754717,51.913358490600004,48.408452830200005,50.6988679245,79.5740377358
68.7089943925,40.371149158899996,27.1308201869,40.9406803738,44.5025226168,69.9884859813,45.9264269159,86.2210093458

评审规则

本次竞赛的评价标准回归MAE进行评价,数值越小性能更优,最高分为0。评估代码参考:

from sklearn.metrics import mean_absolute_error
y_true = [3, -0.5, 2, 7]
y_pred = [2.5, 0.0, 2, 8]
mean_absolute_error(y_true, y_pred)

步骤1:数据集解压

!echo y | unzip -O CP936 /home/aistudio/data/data117050/人脸关键点检测挑战赛_数据集.zip
!mv 人脸关键点检测挑战赛_数据集/* ./
!echo y | unzip test.npy.zip
!echo y | unzip train.npy.zip
Archive:  /home/aistudio/data/data117050/人脸关键点检测挑战赛_数据集.zip
  inflating: 人脸关键点检测挑战赛_数据集/sample_submit.csv  
  inflating: 人脸关键点检测挑战赛_数据集/test.npy.zip  
  inflating: 人脸关键点检测挑战赛_数据集/train.csv  
  inflating: 人脸关键点检测挑战赛_数据集/train.npy.zip  
Archive:  test.npy.zip
replace test.npy? [y]es, [n]o, [A]ll, [N]one, [r]ename:   inflating: test.npy                
Archive:  train.npy.zip
replace train.npy? [y]es, [n]o, [A]ll, [N]one, [r]ename:   inflating: train.npy               

步骤2:数据集读取

import pandas as pd
import numpy as np
  • train.csv:存储的是八个关键点的坐标。
  • train.npy:训练集图像
  • test.npy:测试集图像
# 读取标注
train_df = pd.read_csv('train.csv')
train_df = train_df.fillna(48)
train_df.head()
left_eye_center_xleft_eye_center_yright_eye_center_xright_eye_center_ynose_tip_xnose_tip_ymouth_center_bottom_lip_xmouth_center_bottom_lip_y
066.34236438.52361328.93084035.57777349.25684568.27595547.78394685.361582
168.91260431.40911729.65222633.02807551.91335848.40845350.69886879.574038
268.70899440.37114927.13082040.94068044.50252369.98848645.92642786.221009
365.33417635.47187829.36646137.76768450.41137364.93476750.02878074.883241
468.63485729.99948631.09457129.61642950.24742951.45085747.94857184.394286
# 读取数据集
train_img = np.load('train.npy')
test_img = np.load('test.npy')

train_img = np.transpose(train_img, [2, 0, 1])
train_img = train_img.reshape(-1, 1, 96, 96)

test_img = np.transpose(test_img, [2, 0, 1])
test_img = test_img.reshape(-1, 1, 96, 96)

print(train_img.shape, test_img.shape)
(5000, 1, 96, 96) (2049, 1, 96, 96)

步骤3: 数据集可视化

%pylab inline
idx = 409
xy = train_df.iloc[idx].values.reshape(-1, 2)
plt.scatter(xy[:, 0], xy[:, 1], c='r')
plt.imshow(train_img[idx, 0, :, :], cmap='gray')

/opt/conda/envs/python35-paddle120-env/lib/python3.7/site-packages/matplotlib/__init__.py:107: DeprecationWarning: Using or importing the ABCs from 'collections' instead of from 'collections.abc' is deprecated, and in 3.8 it will stop working
  from collections import MutableMapping
/opt/conda/envs/python35-paddle120-env/lib/python3.7/site-packages/matplotlib/rcsetup.py:20: DeprecationWarning: Using or importing the ABCs from 'collections' instead of from 'collections.abc' is deprecated, and in 3.8 it will stop working
  from collections import Iterable, Mapping
/opt/conda/envs/python35-paddle120-env/lib/python3.7/site-packages/matplotlib/colors.py:53: DeprecationWarning: Using or importing the ABCs from 'collections' instead of from 'collections.abc' is deprecated, and in 3.8 it will stop working
  from collections import Sized


Populating the interactive namespace from numpy and matplotlib


/opt/conda/envs/python35-paddle120-env/lib/python3.7/site-packages/matplotlib/cbook/__init__.py:2349: DeprecationWarning: Using or importing the ABCs from 'collections' instead of from 'collections.abc' is deprecated, and in 3.8 it will stop working
  if isinstance(obj, collections.Iterator):
/opt/conda/envs/python35-paddle120-env/lib/python3.7/site-packages/matplotlib/cbook/__init__.py:2366: DeprecationWarning: Using or importing the ABCs from 'collections' instead of from 'collections.abc' is deprecated, and in 3.8 it will stop working
  return list(data) if isinstance(data, collections.MappingView) else data





<matplotlib.image.AxesImage at 0x7f917f910250>



/opt/conda/envs/python35-paddle120-env/lib/python3.7/site-packages/matplotlib/image.py:425: DeprecationWarning: np.asscalar(a) is deprecated since NumPy v1.16, use a.item() instead
  a_min = np.asscalar(a_min.astype(scaled_dtype))
/opt/conda/envs/python35-paddle120-env/lib/python3.7/site-packages/matplotlib/image.py:426: DeprecationWarning: np.asscalar(a) is deprecated since NumPy v1.16, use a.item() instead
  a_max = np.asscalar(a_max.astype(scaled_dtype))

[外链图片转存失败,源站可能有防盗链机制,建议将图片保存下来直接上传(img-uJ4DY3QK-1646534169422)(output_9_5.png)]

idx = 4090
xy = train_df.iloc[idx].values.reshape(-1, 2)
plt.scatter(xy[:, 0], xy[:, 1], c='r')
plt.imshow(train_img[idx, 0, :, :], cmap='gray')
<matplotlib.image.AxesImage at 0x7f9158b1c550>



/opt/conda/envs/python35-paddle120-env/lib/python3.7/site-packages/matplotlib/image.py:425: DeprecationWarning: np.asscalar(a) is deprecated since NumPy v1.16, use a.item() instead
  a_min = np.asscalar(a_min.astype(scaled_dtype))
/opt/conda/envs/python35-paddle120-env/lib/python3.7/site-packages/matplotlib/image.py:426: DeprecationWarning: np.asscalar(a) is deprecated since NumPy v1.16, use a.item() instead
  a_max = np.asscalar(a_max.astype(scaled_dtype))

[外链图片转存失败,源站可能有防盗链机制,建议将图片保存下来直接上传(img-lxc83zze-1646534169422)(output_10_2.png)]

xy = 96 - train_df.mean(0).values.reshape(-1, 2)
plt.scatter(xy[:, 0], xy[:, 1], c='r')
<matplotlib.collections.PathCollection at 0x7f9158b07ed0>

[外链图片转存失败,源站可能有防盗链机制,建议将图片保存下来直接上传(img-nKBrfJ19-1646534169423)(output_11_1.png)]

步骤4:构建模型和数据集

import paddle
paddle.__version__
'2.2.2'

全连接模型

from paddle.io import DataLoader, Dataset
from PIL import Image

# 自定义模型
class MyDataset(Dataset):
    def __init__(self, img, keypoint):
        super(MyDataset, self).__init__()
        self.img = img
        self.keypoint = keypoint
    
    def __getitem__(self, index):
        img = Image.fromarray(self.img[index, 0, :, :])
        return np.asarray(img).astype(np.float32)/255, self.keypoint[index] / 96.0

    def __len__(self):
        return len(self.keypoint)

# 训练集
train_dataset = MyDataset(
    train_img[:-500, :, :, :], 
    paddle.to_tensor(train_df.values[:-500].astype(np.float32))
)
train_loader = DataLoader(train_dataset, batch_size=64, shuffle=True)

# 验证集
val_dataset = MyDataset(
    train_img[-500:, :, :, :], 
    paddle.to_tensor(train_df.values[-500:].astype(np.float32))
)
val_loader = DataLoader(val_dataset, batch_size=64, shuffle=False)

# 测试集
test_dataset = MyDataset(
    test_img[:, :, :], 
    paddle.to_tensor(np.zeros((test_img.shape[2], 8)))
)
test_loader = DataLoader(test_dataset, batch_size=64, shuffle=False)
# 定义全连接模型
model = paddle.nn.Sequential(
    paddle.nn.Flatten(),
    paddle.nn.Linear(96*96,128),
    paddle.nn.LeakyReLU(),
    paddle.nn.Linear(128, 8)
)

paddle.summary(model, (64, 96, 96))
W0123 00:43:41.304462   119 device_context.cc:447] Please NOTE: device: 0, GPU Compute Capability: 7.0, Driver API Version: 10.1, Runtime API Version: 10.1
W0123 00:43:41.309953   119 device_context.cc:465] device: 0, cuDNN Version: 7.6.


---------------------------------------------------------------------------
 Layer (type)       Input Shape          Output Shape         Param #    
===========================================================================
   Flatten-1       [[64, 96, 96]]         [64, 9216]             0       
   Linear-1         [[64, 9216]]          [64, 128]          1,179,776   
  LeakyReLU-1       [[64, 128]]           [64, 128]              0       
   Linear-2         [[64, 128]]            [64, 8]             1,032     
===========================================================================
Total params: 1,180,808
Trainable params: 1,180,808
Non-trainable params: 0
---------------------------------------------------------------------------
Input size (MB): 2.25
Forward/backward pass size (MB): 4.63
Params size (MB): 4.50
Estimated Total Size (MB): 11.38
---------------------------------------------------------------------------






{'total_params': 1180808, 'trainable_params': 1180808}
# 损失函数和优化器
optimizer = paddle.optimizer.Adam(parameters=model.parameters(), learning_rate=0.0001)
criterion = paddle.nn.MSELoss()

from sklearn.metrics import mean_absolute_error
for epoch in range(0, 40):
    Train_Loss, Val_Loss = [], []
    Train_MAE, Val_MAE = [], []

    # 训练
    model.train()
    for i, (x, y) in enumerate(train_loader):
        pred = model(x)
        loss = criterion(pred, y)
        Train_Loss.append(loss.item())
        loss.backward()
        optimizer.step()
        optimizer.clear_grad()
        Train_MAE.append(mean_absolute_error(y.numpy(), pred.numpy()) * 96  / y.shape[0])
    
    # 验证
    model.eval()
    for i, (x, y) in enumerate(val_loader):
        pred = model(x)
        loss = criterion(pred, y)
        Val_Loss.append(loss.item())
        Val_MAE.append(mean_absolute_error(y.numpy(), pred.numpy()) * 96 / y.shape[0])
    
    if epoch % 1 == 0:
        print(f'\nEpoch: {epoch}')
        print(f'Loss {np.mean(Train_Loss):3.5f}/{np.mean(Val_Loss):3.5f}')
        print(f'MAE {np.mean(Train_MAE):3.5f}/{np.mean(Val_MAE):3.5f}')

Epoch: 0
Loss 0.05956/0.02340
MAE 0.25278/0.18601

Epoch: 1
Loss 0.02075/0.02269
MAE 0.17376/0.17984

Epoch: 2
Loss 0.01832/0.01881
MAE 0.16236/0.16371

Epoch: 3
Loss 0.01752/0.01729
MAE 0.15944/0.15727

Epoch: 4
Loss 0.01630/0.01783
MAE 0.15351/0.16075

Epoch: 5
Loss 0.01535/0.01593
MAE 0.14883/0.15059

Epoch: 6
Loss 0.01489/0.01655
MAE 0.14582/0.15519

Epoch: 7
Loss 0.01469/0.01596
MAE 0.14487/0.14971

Epoch: 8
Loss 0.01362/0.01582
MAE 0.13930/0.15087

Epoch: 9
Loss 0.01355/0.01506
MAE 0.13915/0.14637

Epoch: 10
Loss 0.01293/0.01490
MAE 0.13586/0.14514

Epoch: 11
Loss 0.01289/0.01367
MAE 0.13555/0.13847

Epoch: 12
Loss 0.01187/0.01372
MAE 0.12944/0.13950

Epoch: 13
Loss 0.01184/0.01281
MAE 0.12905/0.13358

Epoch: 14
Loss 0.01181/0.01534
MAE 0.12995/0.14891

Epoch: 15
Loss 0.01124/0.01334
MAE 0.12593/0.13727

Epoch: 16
Loss 0.01083/0.01371
MAE 0.12342/0.14003

Epoch: 17
Loss 0.01057/0.01181
MAE 0.12188/0.12769

Epoch: 18
Loss 0.01041/0.01207
MAE 0.12105/0.12884

Epoch: 19
Loss 0.01017/0.01149
MAE 0.11868/0.12613

Epoch: 20
Loss 0.00965/0.01348
MAE 0.11610/0.13499

Epoch: 21
Loss 0.00993/0.01133
MAE 0.11817/0.12543

Epoch: 22
Loss 0.00906/0.01080
MAE 0.11226/0.12200

Epoch: 23
Loss 0.00883/0.01117
MAE 0.11127/0.12394

Epoch: 24
Loss 0.00865/0.01064
MAE 0.10986/0.12086

Epoch: 25
Loss 0.00924/0.01023
MAE 0.11396/0.11844

Epoch: 26
Loss 0.00850/0.01001
MAE 0.10874/0.11812

Epoch: 27
Loss 0.00801/0.00998
MAE 0.10525/0.11665

Epoch: 28
Loss 0.00809/0.00978
MAE 0.10666/0.11558

Epoch: 29
Loss 0.00743/0.01073
MAE 0.10161/0.12184

Epoch: 30
Loss 0.00752/0.00916
MAE 0.10146/0.11186

Epoch: 31
Loss 0.00715/0.00982
MAE 0.09895/0.11673

Epoch: 32
Loss 0.00717/0.00907
MAE 0.09980/0.11068

Epoch: 33
Loss 0.00718/0.00967
MAE 0.09976/0.11560

Epoch: 34
Loss 0.00677/0.01463
MAE 0.09663/0.14721

Epoch: 35
Loss 0.00764/0.00852
MAE 0.10249/0.10766

Epoch: 36
Loss 0.00650/0.00916
MAE 0.09434/0.11061

Epoch: 37
Loss 0.00644/0.00840
MAE 0.09397/0.10676

Epoch: 38
Loss 0.00642/0.00852
MAE 0.09410/0.10684

Epoch: 39
Loss 0.00611/0.00798
MAE 0.09161/0.10284
# 预测函数
def make_predict(model, loader):
    model.eval()
    predict_list = []
    for i, (x, y) in enumerate(loader):
        pred = model(x)
        predict_list.append(pred.numpy())
    return np.vstack(predict_list)

test_pred = make_predict(model, test_loader) * 96
idx = 40
xy = test_pred[idx, :].reshape(-1, 2)
plt.scatter(xy[:, 0], xy[:, 1], c='r')
plt.imshow(test_img[idx, 0, :, :], cmap='gray')
<matplotlib.image.AxesImage at 0x7fd715545490>



/opt/conda/envs/python35-paddle120-env/lib/python3.7/site-packages/matplotlib/image.py:425: DeprecationWarning: np.asscalar(a) is deprecated since NumPy v1.16, use a.item() instead
  a_min = np.asscalar(a_min.astype(scaled_dtype))
/opt/conda/envs/python35-paddle120-env/lib/python3.7/site-packages/matplotlib/image.py:426: DeprecationWarning: np.asscalar(a) is deprecated since NumPy v1.16, use a.item() instead
  a_max = np.asscalar(a_max.astype(scaled_dtype))

[外链图片转存失败,源站可能有防盗链机制,建议将图片保存下来直接上传(img-foUPG4d8-1646534169423)(output_19_2.png)]

idx = 42
xy = test_pred[idx, :].reshape(-1, 2)
plt.scatter(xy[:, 0], xy[:, 1], c='r')
plt.imshow(test_img[idx, 0, :, :], cmap='gray')
<matplotlib.image.AxesImage at 0x7fd7144d9350>

[外链图片转存失败,源站可能有防盗链机制,建议将图片保存下来直接上传(img-gwLq47KP-1646534169424)(output_20_1.png)]

CNN模型

from paddle.io import DataLoader, Dataset
from PIL import Image

class MyDataset(Dataset):
    def __init__(self, img, keypoint):
        super(MyDataset, self).__init__()
        self.img = img
        self.keypoint = keypoint
    
    def __getitem__(self, index):
        img = Image.fromarray(self.img[index, 0, :, :])
        return np.asarray(img).reshape(1, 96, 96).astype(np.float32)/255, self.keypoint[index] / 96.0

    def __len__(self):
        return len(self.keypoint)

train_dataset = MyDataset(
    train_img[:-500, :, :, :], 
    paddle.to_tensor(train_df.values[:-500].astype(np.float32))
)
train_loader = DataLoader(train_dataset, batch_size=64, shuffle=True)

val_dataset = MyDataset(
    train_img[-500:, :, :, :], 
    paddle.to_tensor(train_df.values[-500:].astype(np.float32))
)
val_loader = DataLoader(val_dataset, batch_size=64, shuffle=False)

test_dataset = MyDataset(
    test_img[:, :, :], 
    paddle.to_tensor(np.zeros((test_img.shape[2], 8)))
)
test_loader = DataLoader(test_dataset, batch_size=64, shuffle=False)
# 卷积模型
model = paddle.nn.Sequential(
    paddle.nn.Conv2D(1, 10, (5, 5)),
    paddle.nn.ReLU(),
    paddle.nn.MaxPool2D((2, 2)),

    paddle.nn.Conv2D(10, 20, (5, 5)),
    paddle.nn.ReLU(),
    paddle.nn.MaxPool2D((2, 2)),

    paddle.nn.Conv2D(20, 40, (5, 5)),
    paddle.nn.ReLU(),
    paddle.nn.MaxPool2D((2, 2)),

    paddle.nn.Flatten(),
    paddle.nn.Linear(2560, 8),
)

paddle.summary(model, (64, 1, 96, 96))
---------------------------------------------------------------------------
 Layer (type)       Input Shape          Output Shape         Param #    
===========================================================================
   Conv2D-4      [[64, 1, 96, 96]]     [64, 10, 92, 92]         260      
    ReLU-4       [[64, 10, 92, 92]]    [64, 10, 92, 92]          0       
  MaxPool2D-4    [[64, 10, 92, 92]]    [64, 10, 46, 46]          0       
   Conv2D-5      [[64, 10, 46, 46]]    [64, 20, 42, 42]        5,020     
    ReLU-5       [[64, 20, 42, 42]]    [64, 20, 42, 42]          0       
  MaxPool2D-5    [[64, 20, 42, 42]]    [64, 20, 21, 21]          0       
   Conv2D-6      [[64, 20, 21, 21]]    [64, 40, 17, 17]       20,040     
    ReLU-6       [[64, 40, 17, 17]]    [64, 40, 17, 17]          0       
  MaxPool2D-6    [[64, 40, 17, 17]]     [64, 40, 8, 8]           0       
   Flatten-3      [[64, 40, 8, 8]]        [64, 2560]             0       
   Linear-4         [[64, 2560]]           [64, 8]            20,488     
===========================================================================
Total params: 45,808
Trainable params: 45,808
Non-trainable params: 0
---------------------------------------------------------------------------
Input size (MB): 2.25
Forward/backward pass size (MB): 145.54
Params size (MB): 0.17
Estimated Total Size (MB): 147.97
---------------------------------------------------------------------------






{'total_params': 45808, 'trainable_params': 45808}
# 损失函数和优化器
optimizer = paddle.optimizer.Adam(parameters=model.parameters(), learning_rate=0.0001)
criterion = paddle.nn.MSELoss()

from sklearn.metrics import mean_absolute_error
for epoch in range(0, 40):
    Train_Loss, Val_Loss = [], []
    Train_MAE, Val_MAE = [], []
    
    # 训练
    model.train()
    for i, (x, y) in enumerate(train_loader):
        pred = model(x)

        loss = criterion(pred, y)
        Train_Loss.append(loss.item())
        loss.backward()
        optimizer.step()
        optimizer.clear_grad()
        Train_MAE.append(mean_absolute_error(y.numpy(), pred.numpy()) * 96  / y.shape[0])
    
    # 验证
    model.eval()
    for i, (x, y) in enumerate(val_loader):
        pred = model(x)
        loss = criterion(pred, y)
        Val_Loss.append(loss.item())
        Val_MAE.append(mean_absolute_error(y.numpy(), pred.numpy()) * 96 / y.shape[0])
    
    if epoch % 1 == 0:
        print(f'\nEpoch: {epoch}')
        print(f'Loss {np.mean(Train_Loss):3.5f}/{np.mean(Val_Loss):3.5f}')
        print(f'MAE {np.mean(Train_MAE):3.5f}/{np.mean(Val_MAE):3.5f}')

Epoch: 0
Loss 0.23343/0.03865
MAE 0.44735/0.23946

Epoch: 1
Loss 0.03499/0.03301
MAE 0.22689/0.22072

Epoch: 2
Loss 0.03006/0.02846
MAE 0.20913/0.20492

Epoch: 3
Loss 0.02614/0.02548
MAE 0.19541/0.19341

Epoch: 4
Loss 0.02270/0.02314
MAE 0.18112/0.18211

Epoch: 5
Loss 0.01965/0.01952
MAE 0.16927/0.16763

Epoch: 6
Loss 0.01704/0.01763
MAE 0.15715/0.15866

Epoch: 7
Loss 0.01492/0.01483
MAE 0.14711/0.14516

Epoch: 8
Loss 0.01260/0.01268
MAE 0.13498/0.13350

Epoch: 9
Loss 0.01034/0.00996
MAE 0.12187/0.11828

Epoch: 10
Loss 0.00855/0.00836
MAE 0.11041/0.10738

Epoch: 11
Loss 0.00751/0.00737
MAE 0.10320/0.10133

Epoch: 12
Loss 0.00644/0.00657
MAE 0.09478/0.09471

Epoch: 13
Loss 0.00592/0.00626
MAE 0.09048/0.09321

Epoch: 14
Loss 0.00556/0.00568
MAE 0.08704/0.08790

Epoch: 15
Loss 0.00518/0.00538
MAE 0.08444/0.08551

Epoch: 16
Loss 0.00491/0.00524
MAE 0.08204/0.08433

Epoch: 17
Loss 0.00474/0.00495
MAE 0.08087/0.08178

Epoch: 18
Loss 0.00450/0.00476
MAE 0.07885/0.08041

Epoch: 19
Loss 0.00431/0.00460
MAE 0.07685/0.07922

Epoch: 20
Loss 0.00421/0.00458
MAE 0.07596/0.07887

Epoch: 21
Loss 0.00393/0.00421
MAE 0.07302/0.07515

Epoch: 22
Loss 0.00387/0.00419
MAE 0.07282/0.07502

Epoch: 23
Loss 0.00373/0.00416
MAE 0.07131/0.07482

Epoch: 24
Loss 0.00354/0.00385
MAE 0.06945/0.07177

Epoch: 25
Loss 0.00347/0.00386
MAE 0.06882/0.07173

Epoch: 26
Loss 0.00340/0.00368
MAE 0.06781/0.06999

Epoch: 27
Loss 0.00323/0.00363
MAE 0.06601/0.06949

Epoch: 28
Loss 0.00320/0.00349
MAE 0.06580/0.06794

Epoch: 29
Loss 0.00307/0.00349
MAE 0.06427/0.06842

Epoch: 30
Loss 0.00300/0.00336
MAE 0.06357/0.06692

Epoch: 31
Loss 0.00291/0.00329
MAE 0.06240/0.06611

Epoch: 32
Loss 0.00287/0.00326
MAE 0.06206/0.06594

Epoch: 33
Loss 0.00280/0.00323
MAE 0.06119/0.06572

Epoch: 34
Loss 0.00276/0.00312
MAE 0.06076/0.06427

Epoch: 35
Loss 0.00268/0.00304
MAE 0.05994/0.06345

Epoch: 36
Loss 0.00262/0.00301
MAE 0.05915/0.06306

Epoch: 37
Loss 0.00256/0.00294
MAE 0.05834/0.06231

Epoch: 38
Loss 0.00256/0.00288
MAE 0.05833/0.06166

Epoch: 39
Loss 0.00246/0.00284
MAE 0.05717/0.06128
def make_predict(model, loader):
    model.eval()
    predict_list = []
    for i, (x, y) in enumerate(loader):
        pred = model(x)
        predict_list.append(pred.numpy())
    return np.vstack(predict_list)

test_pred = make_predict(model, test_loader) * 96
idx = 40
xy = test_pred[idx, :].reshape(-1, 2)
plt.scatter(xy[:, 0], xy[:, 1], c='r')
plt.imshow(test_img[idx, 0, :, :], cmap='gray')
<matplotlib.image.AxesImage at 0x7f883439f290>



/opt/conda/envs/python35-paddle120-env/lib/python3.7/site-packages/matplotlib/image.py:425: DeprecationWarning: np.asscalar(a) is deprecated since NumPy v1.16, use a.item() instead
  a_min = np.asscalar(a_min.astype(scaled_dtype))
/opt/conda/envs/python35-paddle120-env/lib/python3.7/site-packages/matplotlib/image.py:426: DeprecationWarning: np.asscalar(a) is deprecated since NumPy v1.16, use a.item() instead
  a_max = np.asscalar(a_max.astype(scaled_dtype))

[外链图片转存失败,源站可能有防盗链机制,建议将图片保存下来直接上传(img-EP0ZxNvy-1646534169424)(output_26_2.png)]

idx = 42
xy = test_pred[idx, :].reshape(-1, 2)
plt.scatter(xy[:, 0], xy[:, 1], c='r')
plt.imshow(test_img[idx, 0, :, :], cmap='gray')
<matplotlib.image.AxesImage at 0x7f8834329d10>



/opt/conda/envs/python35-paddle120-env/lib/python3.7/site-packages/matplotlib/image.py:425: DeprecationWarning: np.asscalar(a) is deprecated since NumPy v1.16, use a.item() instead
  a_min = np.asscalar(a_min.astype(scaled_dtype))
/opt/conda/envs/python35-paddle120-env/lib/python3.7/site-packages/matplotlib/image.py:426: DeprecationWarning: np.asscalar(a) is deprecated since NumPy v1.16, use a.item() instead
  a_max = np.asscalar(a_max.astype(scaled_dtype))

[外链图片转存失败,源站可能有防盗链机制,建议将图片保存下来直接上传(img-VG4WggrM-1646534169424)(output_27_2.png)]

总结与改进

  1. 本文搭建了全连接网络和卷积神经网络完成了的关键点预测模型。
  2. 由于全连接网络和卷积网络的输入尺寸不同,所以在数据处理过程存在区别。
  3. 可以考虑额外的数据扩增操作和预训练模型改进现有思路。

Logo

学大模型,用大模型上飞桨星河社区!每天8点V100G算力免费领!免费领取ERNIE 4.0 100w Token >>>

更多推荐