Skip to content

Commit bca8c27

Browse files
yadaishBernard-Liu
authored andcommitted
fix performance issuse
1 parent 1f82f3b commit bca8c27

File tree

1 file changed

+1
-0
lines changed

1 file changed

+1
-0
lines changed

fbgemm_gpu/codegen/training/backward/embedding_backward_split_template.cu

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -1425,6 +1425,7 @@ Tensor {{ embedding_cuda_op }}(
14251425
32,
14261426
false>;
14271427
blockSize = dim3(32, num_warp_per_row_groups);
1428+
// printf("%s:%d warp kernel %d\n", __FILE__, __LINE__, num_warp_per_row_groups);
14281429
}
14291430
}
14301431
{%- endif %}

0 commit comments

Comments
 (0)