【PyTorch】Torchvision Models

文章目录

六、Torchvision Models

六、Torchvision Models

1、VGG

VGG参考文档：https://pytorch.org/vision/stable/models/vgg.html

以VGG16为例：

https://pytorch.org/vision/stable/models/generated/torchvision.models.vgg16.html#torchvision.models.vgg16

ImageNet数据集：

https://pytorch.org/vision/stable/generated/torchvision.datasets.ImageNet.html#torchvision.datasets.ImageNet

ImageNet描述：

https://image-net.org/challenges/LSVRC/index.php

train_data = torchvision.datasets.ImageNet("../data", split="train", transform=torchvision.transforms.ToTensor(),
                                           download=True)
1
2

报错：需要手动下载！！！（100多G，还是算了吧）

RuntimeError: The archive ILSVRC2012_devkit_t12.tar.gz is not present in the root directory or is corrupted. You need to download it externally and place it in ../data.

import torchvision

vgg16_false = torchvision.models.vgg16(pretrained=False)  # False 加载网络模型 不需要下载
vgg16_true = torchvision.models.vgg16(pretrained=True)

print(vgg16_true)
1
2
3
4
5
6

VGG(
  (features): Sequential(
    (0): Conv2d(3, 64, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1))
    (1): ReLU(inplace=True)
    (2): Conv2d(64, 64, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1))
    (3): ReLU(inplace=True)
    (4): MaxPool2d(kernel_size=2, stride=2, padding=0, dilation=1, ceil_mode=False)
    (5): Conv2d(64, 128, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1))
    (6): ReLU(inplace=True)
    (7): Conv2d(128, 128, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1))
    (8): ReLU(inplace=True)
    (9): MaxPool2d(kernel_size=2, stride=2, padding=0, dilation=1, ceil_mode=False)
    (10): Conv2d(128, 256, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1))
    (11): ReLU(inplace=True)
    (12): Conv2d(256, 256, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1))
    (13): ReLU(inplace=True)
    (14): Conv2d(256, 256, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1))
    (15): ReLU(inplace=True)
    (16): MaxPool2d(kernel_size=2, stride=2, padding=0, dilation=1, ceil_mode=False)
    (17): Conv2d(256, 512, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1))
    (18): ReLU(inplace=True)
    (19): Conv2d(512, 512, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1))
    (20): ReLU(inplace=True)
    (21): Conv2d(512, 512, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1))
    (22): ReLU(inplace=True)
    (23): MaxPool2d(kernel_size=2, stride=2, padding=0, dilation=1, ceil_mode=False)
    (24): Conv2d(512, 512, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1))
    (25): ReLU(inplace=True)
    (26): Conv2d(512, 512, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1))
    (27): ReLU(inplace=True)
    (28): Conv2d(512, 512, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1))
    (29): ReLU(inplace=True)
    (30): MaxPool2d(kernel_size=2, stride=2, padding=0, dilation=1, ceil_mode=False)
  )
  (avgpool): AdaptiveAvgPool2d(output_size=(7, 7))
  (classifier): Sequential(
    (0): Linear(in_features=25088, out_features=4096, bias=True)
    (1): ReLU(inplace=True)
    (2): Dropout(p=0.5, inplace=False)
    (3): Linear(in_features=4096, out_features=4096, bias=True)
    (4): ReLU(inplace=True)
    (5): Dropout(p=0.5, inplace=False)
    (6): Linear(in_features=4096, out_features=1000, bias=True)
  )
)
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45

out_features=1000，输出为1000个类，如果想要输出10个类，应该如何？

1.1 add

（1）在VGG16中的features中添加add_linear：

import torchvision
from torch import nn

vgg16_true = torchvision.models.vgg16(pretrained=True)
print(vgg16_true)

vgg16_true.add_module('add_linear', nn.Linear(in_features=1000, out_features=10))
print(vgg16_true)
1
2
3
4
5
6
7
8

VGG(
  (features): Sequential(
    (0): Conv2d(3, 64, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1))
    ...
  )
  (avgpool): AdaptiveAvgPool2d(output_size=(7, 7))
  (classifier): Sequential(
    (0): Linear(in_features=25088, out_features=4096, bias=True)
    ...
    (6): Linear(in_features=4096, out_features=1000, bias=True)
  )
  (add_linear): Linear(in_features=1000, out_features=10, bias=True)
)
1
2
3
4
5
6
7
8
9
10
11
12
13

（2）在VGG16中的classifier中添加add_linear：

vgg16_true.classifier.add_module('add_linear', nn.Linear(in_features=1000, out_features=10))
1

VGG(
  (features): Sequential(
    ...
  )
  (avgpool): AdaptiveAvgPool2d(output_size=(7, 7))
  (classifier): Sequential(
    ...
    (6): Linear(in_features=4096, out_features=1000, bias=True)
    (add_linear): Linear(in_features=1000, out_features=10, bias=True)
  )
)
1
2
3
4
5
6
7
8
9
10
11

1.2 modify

直接将out_features=1000，修改为，输出100：

vgg16_true.classifier[6] = nn.Linear(in_features=1000, out_features=10)
1

VGG(
  (features): Sequential(
    ...
  )
  (avgpool): AdaptiveAvgPool2d(output_size=(7, 7))
  (classifier): Sequential(
   ...
    (6): Linear(in_features=1000, out_features=10, bias=True)
  )
)
1
2
3
4
5
6
7
8
9
10

2、Save and Load

2.1 模型结构 + 模型参数

vgg16 = torchvision.models.vgg16(pretrained=False)
torch.save(vgg16, "../model/vgg16_method1.pth")
1
2

vgg16 = torch.load("../model/vgg16_method1.pth")
print(vgg16)
1
2

VGG(
  (features): Sequential(
    ...
  )
  (avgpool): AdaptiveAvgPool2d(output_size=(7, 7))
  (classifier): Sequential(
    ...
  )
)
1
2
3
4
5
6
7
8
9

2.2 模型参数（官方推荐）

vgg16 = torchvision.models.vgg16(pretrained=False)
torch.save(vgg16.state_dict(), "../model/vgg16_method2.pth")
1
2

查看一下保存的字典：

vgg16 = torch.load("../model/vgg16_method2.pth")
print(vgg16)
1
2

OrderedDict([('features.0.weight', tensor([[[[-0.0108,  0.0403, -0.0032],
          [-0.0723,  0.0372, -0.1241],
          [-0.0583, -0.1042, -0.0469]],
          ...
          ...
          ...
          0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0.,
        0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0.,
        0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0.]))])
1
2
3
4
5
6
7
8
9

加载：

vgg16 = torchvision.models.vgg16(pretrained=False)
vgg16.load_state_dict(torch.load("../model/vgg16_method2.pth"))
print(vgg16)
1
2
3

VGG(
  (features): Sequential(
    ...
  )
  (avgpool): AdaptiveAvgPool2d(output_size=(7, 7))
  (classifier): Sequential(
    ...
  )
)
1
2
3
4
5
6
7
8
9

2.3 Trap

当我们使用第一种方式保存自己定义的网络模型时：

class Liang(nn.Module):
    def __init__(self):
        super(Liang, self).__init__()
        self.conv1 = nn.Conv2d(3, 6, 5)

    def forward(self, x):
        x = self.conv1(x)
        return x


liang = Liang()
torch.save(liang, "../model/Liang.pth")
1
2
3
4
5
6
7
8
9
10
11
12

再使用第一种方式加载模型时：

liang = torch.load("../model/Liang.pth")
print(liang)
1
2

会报错：AttributeError: Can't get attribute 'Liang' on >

解决方法一：加上class类

class Liang(nn.Module):
    def __init__(self):
        super(Liang, self).__init__()
        self.conv1 = nn.Conv2d(3, 6, 5)

    def forward(self, x):
        x = self.conv1(x)
        return x


liang = torch.load("../model/Liang.pth")
print(liang)
1
2
3
4
5
6
7
8
9
10
11
12

Liang(
  (conv1): Conv2d(3, 6, kernel_size=(5, 5), stride=(1, 1))
)
1
2
3

解决方法二：引入

首先将class Liang类写入all_class.py 文件中，再使用 from all_class import *，直接引用！

from all_class import *

vgg16 = torchvision.models.vgg16(pretrained=False)
vgg16.load_state_dict(torch.load("../model/vgg16_method2.pth"))

liang = torch.load("../model/Liang.pth")
print(liang)
1
2
3
4
5
6
7

Liang(
  (conv1): Conv2d(3, 6, kernel_size=(5, 5), stride=(1, 1))
)
1
2
3

相关阅读:
护理教育学-简答题
 Spring中自定义类型转换器
 FPGA - 7系列 FPGA SelectIO -08- 高级逻辑资源之OSERDESE2
搭建私有Git服务器：GitLab部署详解
 在家中访问一个网站的思考
 WeetCode3 暴力递归->记忆化搜索->动态规划
 机器学习入门实战加州房价预测
 享元模式具体实例（含代码）
MinGW相关错误
 protobuf使用详解
原文地址：https://blog.csdn.net/m0_70885101/article/details/128026455