BUG:AttributeError: ‘GLMChineseTokenizer‘ object has no attribute ‘sp_model’

BUG:AttributeError: ‘GLMChineseTokenizer‘ object has no attribute ‘sp_model’
BUG:AttributeError: ‘GLMChineseTokenizer’ object has no attribute 'sp_model’

环境
```
Python 3.10
torch 2.0.1
transformers 4.37.0
```
详情

在运行 glm-large-chinese 模型时弹出的BUG，具体原因不清楚，大概是 transformers 版本改变了，导致一些接口导入方式改变，而glm-large-chinese 的代码还是旧版的。

解决方法

打开模型附带的 tokenization_glm.py 代码文件。修改 GLMChineseTokenizer 类初始化。
```
# 原始
def __init__(self, vocab_file, **kwargs):
    super().__init__(**kwargs)  # 置后
    self.vocab_file = vocab_file
    self.sp_model = spm.SentencePieceProcessor()
    self.sp_model.Load(vocab_file)

# 修改
def __init__(self, vocab_file, **kwargs):
    self.vocab_file = vocab_file
    self.sp_model = spm.SentencePieceProcessor()
    self.sp_model.Load(vocab_file)
    super().__init__(**kwargs)  # 置后
```
参考

https://github.com/baichuan-inc/Baichuan2/issues/204

解决‘BaichuanTokenizer‘ object has no attribute ‘sp_model‘，无需重装transformers和torch_baichuantokenizer’ obiect has no attribute’sp mode-CSDN博客
相关阅读:
2022年C等级考试九月一级真题B：成绩判定
 【图论】树链剖分
 二叉树与堆
 Oracle绑定SQL执行计划
 站长号词库：今日热门长尾关键词挖掘 20221201
区块链与云计算的融合：新时代数据安全的挑战与机遇
 C语言变量与常量
 seacms_CNVD-2020-22721_v10.1漏洞分析与复现
 C++ 11 新玩法
 Self-supervised Low Light Image Enhancement and Denoising 论文阅读笔记
原文地址：https://blog.csdn.net/qq_38463737/article/details/139731888

BUG:AttributeError: ‘GLMChineseTokenizer’ object has no attribute 'sp_model’

环境

详情

解决方法

参考