Skip to content

Commit a32ea43

Browse files
committed
Update tokenshift_transformer.md
1 parent 735e834 commit a32ea43

File tree

1 file changed

+4
-4
lines changed

1 file changed

+4
-4
lines changed

docs/en/model_zoo/recognition/tokenshift_transformer.md

Lines changed: 4 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -1,6 +1,6 @@
11
[简体中文](../../../zh-CN/model_zoo/recognition/tokenshift_transformer.md) | English
22

3-
# TimeSformer
3+
# Token Shift Transformer
44

55
## Content
66

@@ -45,7 +45,7 @@ UCF-101 data download and preparation please refer to [UCF-101 data preparation]
4545
MODEL:
4646
framework: "RecognizerTransformer"
4747
backbone:
48-
name: "VisionTransformer"
48+
name: "TokenShiftVisionTransformer"
4949
pretrained: fill in the path here
5050
```
5151
@@ -75,7 +75,7 @@ python3 main.py --amp -c configs/recognition/token_transformer/tokShift_transfor
7575
Already save the best model (top1 acc)0.9201
7676
```
7777

78-
- Since the sampling method of the Token Shift Transformer model test mode is **uniform** sampling, which is different from the **dense** sampling used in the verification mode during the training process, so the verification index recorded in the training log is `topk Acc `Does not represent the final test score, so after the training is completed, you can use the test mode to test the best model to obtain the final index, the command is as follows:
78+
- Since the sampling method of the Token Shift Transformer model test mode is **uniform** sampling, which is different from the **dense** sampling used in the verification mode during the training process, so the verification index recorded in the training log, called `topk Acc `, does not represent the final test score, so after the training is completed, you can use the test mode to test the best model to obtain the final index, the command is as follows:
7979

8080
```bash
8181
python3 main.py --amp -c configs/recognition/token_transformer/tokShift_transformer_ucf101_256_videos.yaml --test --seed=1234 -w 'output/TokenShiftVisionTransformer/TokenShiftVisionTransformer_best.pdparams'
@@ -118,7 +118,7 @@ Current video file: data/BrushingTeeth.avi
118118
top-1 score: 0.9959074258804321
119119
```
120120

121-
It can be seen that using the TimeSformer model trained on Kinetics-400 to predict `data/BrushingTeeth.avi`, the output top1 category id is `19`, and the confidence is 0.99. By consulting the category id and name correspondence table, it can be seen that the predicted category name is `brushing_teeth`.
121+
It can be seen that using the Token Shift Transformer model trained on UCF-101 to predict `data/BrushingTeeth.avi`, the output top1 category id is `19`, and the confidence is 0.99. By consulting the category id and name correspondence table, it can be seen that the predicted category name is `brushing_teeth`.
122122

123123
## Reference
124124

0 commit comments

Comments
 (0)