一、安装
1. 安装 tensorrt python 接口
https://developer.nvidia.com/nvidia-tensorrt-5x-download
tar xvf TensorRT-6.0.1.5.Ubuntu-18.04.x86_64-gnu.cuda-10.1.cudnn7.6.tar.gz
cd pythonpip install tensorrt-6.0.1.5-cp37-none-linux_x86_64.whl
cd uffpip install uff-0.6.5-py2.py3-none-any.whl
pythonimport tensorrt
2. 安装 torch2trt
sudo apt-get install libprotobuf* protobuf-compiler ninja-buildgit clone https://github.com/NVIDIA-AI-IOT/torch2trtcd torch2trtsudo python setup.py install --plugins
二、代码演示
model = BNNproAtt()
model.load_state_dict(torch.load('/src/2_toeng/pytorch_2_eng/reid2trt/BNNproAtt0525.pt', map_location = 'cpu')
model.eval().float().cuda()input_data = torch.rand((4, 3, 384, 128), dtype = torch.float).cuda()t0 = time.time()
out = model(input_data)
t1 = time.time()
print("pytorch costed time: ", t1 - t0)# convert to TensorRT model
model_trt = torch2trt(model, [input_data], max_batch_size = 4, int8_mode = True)t2 = time.time()
out_trt = model_trt(input_data)
t3 = time.time()
print("trt costed time: ", t3 - t2)# check the output against pytorch
print(torch.max(torch.abs(out - out_trt)torch.save(model_trt.state_dict(), '/src/2_toeng/pytroch_2_eng/reid2trt/bnn_trt_int8.pt'