ResNet structure

Hi,

The original tensorflow implementation uses the standard structure for the first convolution layer, i.e., 7x7 kernel size, stride 2, padding 3 and a 3x3 max pooling layer after that [(link)](https://github.com/google-research/meta-dataset/blob/main/meta_dataset/models/functional_backbones.py#L717) while in your implementation this layer is used with 3x3 kernel size and without max pooling [(link)](https://github.com/mboudiaf/pytorch-meta-dataset/blob/master/src/models/standard/resnet.py#L90). In this way the resulted feature map is way larger and costs more memory. I also notice that in the PAMI version of TIM the authors claim that the pytorch version of baselines are much better than the original version. I wonder if the performance boost comes from this modification. The 'larger' version of resnet seems not so practical for meta-dataset, since it will lead to OOM when being trained with the ProtoNet or other episodic methods. I don't know if I have any misunderstanding about the code.

 Thanks.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

ResNet structure #13

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

ResNet structure #13

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions