cuda - FindHao

Pytorch Benchmark笔记

安装 GitHub的安装教程已经很完善，这里不再赘述。 Update mode more ...

CUDA Tips[持续更新]

1. 设置多GPU环境下GPU的可见性如果服务器上有多个GPU，可以设置程序只用某几个GPU。 # for NVIDIA GPUs export CUDA_VISIBLE_DEVICES=0 more ...

NVIDIA Jetson 配置笔记

安装pytorch 现在nvidia官方已经提供了简单的安装命令。 https://docs.nvidia.com/deeplearning/frameworks/install-pytorch-jetson-platform/index.html 是nvidia为jetson系列专门定制的pytorch。但是其他的包还是需要自己编译安 more ...

nsight compute和nsight system的使用笔记

使用ncu和nsys cli的笔记，持续更新。 Nsight Compute ncu主要是获取更细粒度的intra kernel的hardware counters。官方手册官方的profile 指导手册 more ...

大部分情况下，更新nvidia gpu驱动不需要重启机器。如果你的驱动成功更新，但是使用nvidia-smi提示有Failed to initialize NVML: Driver/library version mismatch，一般情况下是因为更新的驱动没有被成功加载。查看当前nvidia driver是否被使用执行第二条命令可以直接列出正在使用gpu的程序。比如nv-hosten是DCGM的server端，直接kill或者使用nv-hostengine -t将其退出即可 more ...

NVIDIA DCGM

Introduction NVIDIA Data Center GPU Manager (DCGM) is a suite of tools for managing and monitoring NVIDIA datacenter GPUs in cluster environments. It also provides APIs to let developers integrate it into their own GPU profiling/monitoring tools. Installation If you have more ...

Pytorch源码编译

Pytorch Benchmark笔记

CUDA Tips[持续更新]

NVIDIA Jetson 配置笔记

nsight compute和nsight system的使用笔记

不重启服务器重新挂载nvidia gpu driver

NVIDIA DCGM

ubuntu nvidia gpu driver的安装

使用GVProf测试Python程序

nvidia docker笔记