addressing cat empty tensor case.Fixes gpt2 data distributed example #3866

apbose · 2025-10-16T19:46:02Z

narendasan · 2025-10-20T19:15:09Z

py/torch_tensorrt/dynamo/conversion/impl/cat.py

    for i, each_input in enumerate(input):
+        if isinstance(each_input, torch.Tensor) and each_input.numel() == 0:
+            logger.warning(
+                f"Warning: empty tensor in cat input {i}, replacing with zeros"


Can you make this warning much more specific? Print information like the current node, if you can where in the graph it comes from etc. Because users will not understand what you mean by this. Also where is the replacing with zeros?

Also if this is caught by the validator then should this be an error? Will conversion fail or can we just ignore it?

Thanks for pointing out the error. I was earlier replacing with zeros, but later changed to continue since replacing with zeros is not required. I will change the warning comment.

The difference between this and the validator is that, if the empty tensor is a torch.Tensor, we can handle it in the converter.

Whereas if the empty tensor comes as an ITensor input to the converter, TensorRT complains. (I was trying to implement it earlier via replacing it with zeros, but that still leads to the error [RemoveDeadLayers] Input Tensor y is unused or used only at compile-time, but is not being removed. To point the difference,

This will pass

def test_cat_with_empty_tensor(self, _, dim): # Handle empty tensor in concat class Cat(nn.Module): def forward(self, x): y = torch.empty(0, 2, 3, device="cuda") return torch.ops.aten.cat.default((x, y), dim) inputs = [ torch.randn(1, 2, 3, device="cuda"), ] self.run_test(Cat(), inputs)

This will fail

def test_cat_with_empty_tensor(self, _, dim): # Handle empty tensor in concat class Cat(nn.Module): def forward(self, x, y): return torch.ops.aten.cat.default((x, y), dim) inputs = [ torch.randn(1, 2, 3, device="cuda"), y = torch.empty(0, 2, 3, device="cuda") ] self.run_test(Cat(), inputs)

narendasan · 2025-10-20T19:18:33Z

py/torch_tensorrt/dynamo/conversion/aten_ops_converters.py

+    return input_tensors, dim
+
+
+def cat_validator(node: Node, settings: Optional[CompilationSettings] = None) -> bool:


I dont really understand this condition. So if we have a TRT ITensor that has a 0 in any dimension then we should break the graph? I dont think at validation time any of these ITensors will be available. Since validation is run prior to paritioning

Should we be checking for empty PyTorch tensors?

Yes ideally. The validation would be based on the ITensor shape. Yes should use the meta data

But then this won't distinguish between ITensor and torch Tensor case.

addressing cat empty tensor case.Fixes gpt2 data distributed example

b7f2dee

meta-cla bot added the cla signed label Oct 16, 2025

github-actions bot requested a review from narendasan October 16, 2025 19:46

correcting the validator error message

88659a1

apbose force-pushed the abose/torchTRT_accelerate_bug_fix branch from bee0f1d to 88659a1 Compare October 16, 2025 21:14

expanding cat converter to address CI error

e6fc22b

apbose force-pushed the abose/torchTRT_accelerate_bug_fix branch from 6e386c1 to e6fc22b Compare October 17, 2025 20:47

narendasan reviewed Oct 20, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

addressing cat empty tensor case.Fixes gpt2 data distributed example #3866

addressing cat empty tensor case.Fixes gpt2 data distributed example #3866

Uh oh!

apbose commented Oct 16, 2025

Uh oh!

narendasan Oct 20, 2025

Uh oh!

narendasan Oct 20, 2025

Uh oh!

apbose Oct 20, 2025 •

edited

Loading

Uh oh!

narendasan Oct 20, 2025

Uh oh!

narendasan Oct 20, 2025

Uh oh!

apbose Oct 20, 2025 •

edited

Loading

Uh oh!

apbose Oct 20, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

		return input_tensors, dim


		def cat_validator(node: Node, settings: Optional[CompilationSettings] = None) -> bool:

addressing cat empty tensor case.Fixes gpt2 data distributed example #3866

Are you sure you want to change the base?

addressing cat empty tensor case.Fixes gpt2 data distributed example #3866

Uh oh!

Conversation

apbose commented Oct 16, 2025

Uh oh!

narendasan Oct 20, 2025

Choose a reason for hiding this comment

Uh oh!

narendasan Oct 20, 2025

Choose a reason for hiding this comment

Uh oh!

apbose Oct 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

narendasan Oct 20, 2025

Choose a reason for hiding this comment

Uh oh!

narendasan Oct 20, 2025

Choose a reason for hiding this comment

Uh oh!

apbose Oct 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

apbose Oct 20, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

apbose Oct 20, 2025 •

edited

Loading

apbose Oct 20, 2025 •

edited

Loading