pytorch训练好的模型在加载和保存过程中的问题

在gpu上训练完成，在cpu上加载

torch.save(model.state_dict(), PATH)# 在gpu上训练后保存

# 在cpu的模型上加载使用
model.load_state_dict(torch.load(PATH, map_location='cpu'))
1
2
3
4

在cpu上训练完成，在gpu上加载

torch.save(model.state_dict(), PATH)# 在gpu上训练后保存

# 在cpu的模型上加载使用
model.load_state_dict(torch.load(PATH, map_location='cuda:0'))
1
2
3
4

在使用中需要注意的加载内容

当数据放入GPU，需要训练的模型也要放入GPU

'''
data_loader:pytorch中加载数据
'''
 for i, sample in enumerate(data_loader):  # 对数据进行按批次遍历
     image, target = sample  #  每一批次加载返回值
     if CUDA:
         image = image.cuda()   # 输入输出传入gpu
         target = target.cuda()
     # print(target.size)
     optimizer.zero_grad()     # 优化函数
     output = mymodel(image)

mymodel.to(torch.device("cuda"))
1
2
3
4
5
6
7
8
9
10
11
12
13

在这里插入图片描述

多个gpu训练时的加载

参考：https://blog.csdn.net/weixin_43794311/article/details/120940090

import torch.nn as nn
mymodel = nn.DataParallel(mymodel)
1
2

pytorch中的nn模块使用nn.DataParallel将模型加载到多个GPU，需要注意，这种加载方式保存的权重参数会比不使用nn.DataParallel加载模型保存的权重参数的关键字前多一个"module."。是否使用nn.DataParallel加载模型，会导致下次再加载模型的时候可能会出现下图的问题，
在这里插入图片描述
当权重参数前面多一个“module."时，最简单的方式就是使用nn.DataParallel对模型加载，

相关阅读:
数据库配置mysql5.7
【数据结构基础】之栈与队列介绍，生动形象，通俗易懂，算法入门必看
Pytorch学习：卷积神经网络—nn.Conv2d、nn.MaxPool2d、nn.ReLU、nn.Linear和nn.Dropout
本地缓存 guava
PCIe系列专题之二：2.2 TLP事务处理方式解析
R语言plotly可视化：plotly可视化箱图、并添加抖动数据点jitter（Adding Jittered Points）
Fastnet，三步完成高性能的网络开发
7.1-安全保护等级 7.2-安全防护体系 7.3-数据安全策略 7.4-安全防护策略
Word格式处理控件Aspose.Words for .NET教程——使用DOM插入字段
【C++ Primer Plus学习记录】指针——小结

原文地址：https://blog.csdn.net/weixin_43794311/article/details/125517326