Skip to content

The padding of I3D model should be symmetrical #32

@fjchange

Description

@fjchange

The Module of MaxPool3dTFPadding with kernel_size=(1,3,3), stride(1,2,2) can lead to asymmetrical padding.
It would influence the output feature map, as the bottom right would be usually higher than other part of the feature map.

When I try to input a all zeros tensor into I3D model pretrained on Kinetics-400, someting strange happen, I average pooling the C and T dim and min-max norm to get a picture as below. The bottom right is much higher than other parts.
grad_cam

By checking each layer output, I find out bottom right is usually have higher activation value than other part but not obvious, until mixed_5b block.

I don't know whether it hurt the model's performance, but at least it hurt the Interpretability.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions