Open
Description
I am working on the reproduction of this paper, and I found that using the baseline model to train the NTU dataset with RGB+Depth two modalities, the highest accuracy I achieved was acc==77, I don’t know why.
In addition, my data set reading speed is very slow.
I wonder if the author or other friends can give me some training experience and suggestions.
I used the dataset downloaded from the official website and used the script to extract frames from the RGB modality videos, 16 frames per sample.
I made very few changes to the source code, and the parameters are basically the default parameters of the source code.
Thank you very much.
Metadata
Metadata
Assignees
Labels
No labels