update cpp_trt code & refine docs

HydrogenSulfate · HydrogenSulfate · commit 3ef50ac0056b · 2022-02-25T15:45:26.000+08:00
diff --git a/deploy/cpp_infer/readme.md b/deploy/cpp_infer/readme.md
@@ -101,12 +101,12 @@ PaddleVideo模型部署。
 
 #### 1.2.1 直接下载安装
 
-* [Paddle预测库官网](https://paddleinference.paddlepaddle.org.cn/v2.1/user_guides/download_lib.html) 上提供了不同cuda版本的Linux预测库，可以在官网查看并**选择合适的预测库版本**（建议选择paddle版本>=2.0.1版本的预测库）。
+* [Paddle预测库官网](https://paddleinference.paddlepaddle.org.cn/v2.2/user_guides/download_lib.html) 上提供了不同cuda版本的Linux预测库，可以在官网查看并**选择合适的预测库版本**（建议选择paddle版本>=2.0.1版本的预测库）。
 
 * 下载得到一个`paddle_inference.tgz`压缩包，然后将它解压成文件夹，命令如下(以机器环境为gcc8.2为例)：
 
     ```bash
-    wget https://paddle-inference-lib.bj.bcebos.com/2.1.1-gpu-cuda10.1-cudnn7-mkl-gcc8.2/paddle_inference.tgz
+    wget https://paddle-inference-lib.bj.bcebos.com/2.2.2/cxx_c/Linux/GPU/x86-64_gcc8.2_avx_mkl_cuda10.1_cudnn7.6.5_trt6.0.1.5/paddle_inference.tgz
     tar -xf paddle_inference.tgz
     ```
 
@@ -118,7 +118,7 @@ PaddleVideo模型部署。
 
     ```shell
     git clone https://github.com/PaddlePaddle/Paddle.git
-    git checkout release/2.1
+    git checkout release/2.2
     ```
 
 * 进入Paddle目录后，编译方法如下。
@@ -217,6 +217,14 @@ PaddleVideo模型部署。
 
 其中，`mode`为必选参数，表示选择的功能，取值范围['rec']，表示**视频识别**（更多功能会陆续加入）。
 
+注意：如果要在预测时开启TensorRT优化选项，需要先运行以下命令设置好TensorRT的相关路径。
+```bash
+export PATH=$PATH:/path/to/cuda/bin
+export LD_LIBRARY_PATH=$LD_LIBRARY_PATH:/path/to/cuda/bin
+export LIBRARY_PATH=$LIBRARY_PATH:/path/to/cuda/bin
+export LD_LIBRARY_PATH=/path/to/TensorRT-x.x.x.x/lib:$LD_LIBRARY_PATH
+```
+
 ##### 1. 调用视频识别：
 ```bash
 # 调用PP-TSM识别
@@ -292,4 +300,4 @@ I1125 08:10:45.834602 13955 autolog.h:67] preprocess_time(ms): 10.6524, inferenc
 
 ### 3 注意
 
-* 在使用Paddle预测库时，推荐使用2.1.0版本的预测库。
+* 在使用Paddle预测库时，推荐使用2.2.2版本的预测库。
diff --git a/deploy/cpp_infer/readme_en.md b/deploy/cpp_infer/readme_en.md
@@ -101,12 +101,12 @@ There are two ways to obtain the Paddle prediction library, which will be descri
 
 #### 1.2.1 Download and install directly
 
-* [Paddle prediction library official website](https://paddleinference.paddlepaddle.org.cn/v2.1/user_guides/download_lib.html) provides different cuda versions of Linux prediction libraries, you can Check and **select the appropriate prediction library version** on the official website (it is recommended to select the prediction library with paddle version>=2.0.1).
+* [Paddle prediction library official website](https://paddleinference.paddlepaddle.org.cn/v2.2/user_guides/download_lib.html) provides different cuda versions of Linux prediction libraries, you can Check and **select the appropriate prediction library version** on the official website (it is recommended to select the prediction library with paddle version>=2.0.1).
 
 * Download and get a `paddle_inference.tgz` compressed package, and then unzip it into a folder, the command is as follows (taking the machine environment as gcc8.2 as an example):
 
     ```bash
-    wget https://paddle-inference-lib.bj.bcebos.com/2.1.1-gpu-cuda10.1-cudnn7-mkl-gcc8.2/paddle_inference.tgz
+    wget https://paddle-inference-lib.bj.bcebos.com/2.2.2/cxx_c/Linux/GPU/x86-64_gcc8.2_avx_mkl_cuda10.1_cudnn7.6.5_trt6.0.1.5/paddle_inference.tgz
     tar -xf paddle_inference.tgz
     ```
 
@@ -118,7 +118,7 @@ There are two ways to obtain the Paddle prediction library, which will be descri
 
     ```shell
     git clone https://github.com/PaddlePaddle/Paddle.git
-    git checkout release/2.1
+    git checkout release/2.2
     ```
 
 * After entering the Paddle directory, the compilation method is as follows.
@@ -217,7 +217,16 @@ Operation mode:
 
 Among them, `mode` is a required parameter, which means the selected function, and the value range is ['rec'], which means **video recognition** (more functions will be added in succession).
 
+Note: Note: If you want to enable the TensorRT optimization option during prediction, you need to run the following command to set the relevant path of TensorRT.
+```bash
+export PATH=$PATH:/path/to/cuda/bin
+export LD_LIBRARY_PATH=$LD_LIBRARY_PATH:/path/to/cuda/bin
+export LIBRARY_PATH=$LIBRARY_PATH:/path/to/cuda/bin
+export LD_LIBRARY_PATH=/path/to/TensorRT-x.x.x.x/lib:$LD_LIBRARY_PATH
+```
+
 ##### 1. Call video recognition:
+
 ```bash
 # run PP-TSM inference
 ./build/ppvideo rec \
@@ -291,4 +300,4 @@ I1125 08:10:45.834602 13955 autolog.h:67] preprocess_time(ms): 10.6524, inferenc
 
 ### 3 Attention
 
-* When using the Paddle prediction library, it is recommended to use the prediction library of version 2.1.0.
+* When using the Paddle prediction library, it is recommended to use the prediction library of version 2.2.2.
diff --git a/deploy/cpp_infer/src/video_rec.cpp b/deploy/cpp_infer/src/video_rec.cpp
@@ -227,7 +227,7 @@ namespace PaddleVideo
                     config.EnableTensorRtEngine(
                         1 << 30, // workspaceSize
                         this->rec_batch_num * this->num_seg * 1, // maxBatchSize
-                        30, // minSubgraphSize
+                        3, // minSubgraphSize
                         precision, // precision
                         false,// useStatic
                         false //useCalibMode
@@ -238,7 +238,7 @@ namespace PaddleVideo
                     config.EnableTensorRtEngine(
                         1 << 30,
                         this->rec_batch_num * this->num_seg * 10,
-                        30, // minSubgraphSize
+                        3, // minSubgraphSize
                         precision,// precision
                         false,// useStatic
                         false //useCalibMode
@@ -249,29 +249,33 @@ namespace PaddleVideo
                     config.EnableTensorRtEngine(
                         1 << 30, // workspaceSize
                         this->rec_batch_num, // maxBatchSize
-                        30, // minSubgraphSize
+                        3, // minSubgraphSize
                         precision,// precision
                         false,// useStatic
                         false //useCalibMode
                     );
                 }
 
-                std::map<std::string, std::vector<int> > min_input_shape =
-                {
-                    {"data_batch_0", {1, this->num_seg, 3, 1, 1}}
-                };
-                std::map<std::string, std::vector<int> > max_input_shape =
-                {
-                    {"data_batch_0", {1, this->num_seg, 3, 256, 256}}
-                };
-                std::map<std::string, std::vector<int> > opt_input_shape =
-                {
-                    {"data_batch_0", {this->rec_batch_num,  this->num_seg, 3, 224, 224}}
-                };
+                std::cout << "Enable TensorRT is: " << config.tensorrt_engine_enabled() << std::endl;
 
-                config.SetTRTDynamicShapeInfo(min_input_shape, max_input_shape,
-                                              opt_input_shape);
-                std::cout << "Enable TensorRT is: " << config.tensorrt_engine_enabled() << std::endl;            }
+                /* some model dose not suppport dynamic shape with TRT, deactivate it by default */
+
+                // std::map<std::string, std::vector<int> > min_input_shape =
+                // {
+                //     {"data_batch_0", {1, this->num_seg, 3, 1, 1}}
+                // };
+                // std::map<std::string, std::vector<int> > max_input_shape =
+                // {
+                //     {"data_batch_0", {1, this->num_seg, 3, 256, 256}}
+                // };
+                // std::map<std::string, std::vector<int> > opt_input_shape =
+                // {
+                //     {"data_batch_0", {this->rec_batch_num,  this->num_seg, 3, 224, 224}}
+                // };
+
+                // config.SetTRTDynamicShapeInfo(min_input_shape, max_input_shape,
+                //                               opt_input_shape);
+            }
         }
         else
         {