You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
current BN implementation results in slower performance. We should perform BN constant folding manually and using scaleND layer.
To Reproduce
In ResNet 18, we notice that the performance gap between Torch-TensorRT and ONNX TensorRT is around 15%. In Torch-TensorRT engine graph, we did not find the CONV layer and BN layer are fused together. This could be the main cause of the performance gap.