• Linux ubuntu 服务器部署详细教程


    系统环境:ubuntu20.04

    安装显卡驱动:

    1. 驱动下载,我选择的是520.56.06
    2. 禁用nouveau驱动
    echo "blacklist nouveau">>/etc/modprobe.d/blacklist.conf
    echo "options nouveau modeset=0">>/etc/modprobe.d/blacklist.conf
    sudo update-initramfs -u
    
    • 1
    • 2
    • 3
    1. 重启按crtl+alt+f2键进入终端模式:安装以下依赖
    sudo apt-get install -y gcc make build-essential
    
    • 1
    1. 安装驱动
      执行权限
    sudo chmod a+x NVIDIA-Linux-x86_64-xxx.xx.run
    
    • 1

    运行

    sudo ./NVIDIA-Linux-x86_64-xxx.xx.run  -no-x-check -no-nouveau-check -no-opengl-files
    
    • 1
    1. 安装过程选项
      默认既可,但要Secure Boot(安全启动)为Disabled 或者关闭

    安装cuda与duann

    1. cuda下载,我选择的是11.8
    sudo ./cuda_xxx_linux.run
    
    • 1

    请注意取消选择安装驱动第一个x去掉,x代表选中
    2. 配置cuda路径

    sudo vim ~/.bashrc
    
    • 1

    添加如下内容

    $ export PATH=/usr/local/cuda/bin${PATH:+:${PATH}}
    $ export LD_LIBRARY_PATH=/usr/local/cuda/lib64\
                             ${LD_LIBRARY_PATH:+:${LD_LIBRARY_PATH}}
    
    • 1
    • 2
    • 3
    source ~/.bashrc
    
    • 1
    1. cudnn安装,我选择的是Local Installer for Linux x86_64 (Tar)
      然后解压代码包:
    tar -xf xx.archive.tar.xz
    
    • 1

    复制到对应文件夹下

    sudo cp include/cudnn.h /usr/local/cuda/include
    sudo cp lib/libcudnn* /usr/local/cuda/lib64
    sudo chmod a+r /usr/local/cuda/include/cudnn.h 
    sudo chmod a+r /usr/local/cuda/lib64/libcudnn*
    
    • 1
    • 2
    • 3
    • 4

    安装docker-gpu

    1. 安装docker
    sudo apt-get remove docker docker-engine docker.io containerd runc
    sudo apt-get update
    sudo apt-get install \
        ca-certificates \
        curl \
        gnupg \
        lsb-release
    sudo mkdir -p /etc/apt/keyrings
    curl -fsSL https://download.docker.com/linux/ubuntu/gpg | sudo gpg --dearmor -o /etc/apt/keyrings/docker.gpg
    echo \
      "deb [arch=$(dpkg --print-architecture) signed-by=/etc/apt/keyrings/docker.gpg] https://download.docker.com/linux/ubuntu \
      $(lsb_release -cs) stable" | sudo tee /etc/apt/sources.list.d/docker.list > /dev/null
    sudo apt-get update
    sudo chmod a+r /etc/apt/keyrings/docker.gpg
    sudo apt-get update
    sudo apt-get install docker-ce docker-ce-cli containerd.io docker-compose-plugin
    sudo docker run hello-world
    
    • 1
    • 2
    • 3
    • 4
    • 5
    • 6
    • 7
    • 8
    • 9
    • 10
    • 11
    • 12
    • 13
    • 14
    • 15
    • 16
    • 17

    设置权限问题
    报:Got permission denied while trying to connect to the Docker daemon socket at

    sudo gpasswd -a nvidia docker
    newgrp docker
    
    • 1
    • 2
    1. 安装docker-gpu
    curl https://get.docker.com | sh \
      && sudo systemctl --now enable docker
    distribution=$(. /etc/os-release;echo $ID$VERSION_ID) \
          && curl -fsSL https://nvidia.github.io/libnvidia-container/gpgkey | sudo gpg --dearmor -o /usr/share/keyrings/nvidia-container-toolkit-keyring.gpg \
          && curl -s -L https://nvidia.github.io/libnvidia-container/$distribution/libnvidia-container.list | \
                sed 's#deb https://#deb [signed-by=/usr/share/keyrings/nvidia-container-toolkit-keyring.gpg] https://#g' | \
                sudo tee /etc/apt/sources.list.d/nvidia-container-toolkit.list
    
    sudo apt-get update
    sudo apt-get install -y nvidia-docker2
    sudo systemctl restart docker
    sudo docker run --rm --gpus all nvidia/cuda:11.0.3-base-ubuntu20.04 nvidia-smi
    
    • 1
    • 2
    • 3
    • 4
    • 5
    • 6
    • 7
    • 8
    • 9
    • 10
    • 11
    • 12
    1. 设置私有仓库
    vim /etc/docker/daemon.json
    
    • 1

    将以下内容写入

    {
        "insecure-registries":["http://xx.xxx.xx.xx:xxxx"]
    }
    
    • 1
    • 2
    • 3

    再次读入配置信息,并启动

    sudo systemctl daemon-reload
    sudo systemctl restart docker
    
    • 1
    • 2

    安装ssh、ifconfig、vim常用工具

    sudo apt install -y net-tools vim
    sudo apt install -y --no-install-recommends openssh-server
    service ssh start
    
    • 1
    • 2
    • 3

    参考文献

    1. 显卡驱动:
      https://blog.csdn.net/weixin_42341590/article/details/109563412
    2. cuda+cudnn
      https://blog.csdn.net/h3c4lenovo/article/details/119003405
    3. docker gpu
      https://docs.nvidia.com/datacenter/cloud-native/container-toolkit/install-guide.html
      https://docs.docker.com/engine/install/ubuntu/
  • 相关阅读:
    SessionManagementConfigurer和SecurityContextConfigurer
    Redis数据持久化(详解+样例)
    天宇优配|多家房企发布再融资预案,最牛地产股九连板
    哈夫曼树实现哈夫曼编码(C++)
    区块链系统Docker&Kuberntes一键部署
    源码编译安装与yum和rpm软件安装详解
    Linux》yum与vim
    【面试题】 ES6知识点详细解析
    8.for循环
    Hadoop提交MR任务
  • 原文地址:https://blog.csdn.net/weixin_44003563/article/details/127616631