优化LayerList类的insert函数 #71540

Qin-sx · 2025-03-10T15:03:52Z

PR Category

User Experience

PR Types

Improvements

Description

优化LayerList类的insert函数
对应issue

insert函数在对index判断时没有考虑_sub_layers为空的情，即[0,0)。

assert isinstance(index, int) and -len(self._sub_layers) <= index < len(
            self._sub_layers
        ), f"index should be an integer in range [{-len(self)}, {len(self)})"

在pytorch中，并没有进行类似的判断，而是直接插入新的layer。

    def insert(self, index: int, module: Module) -> None:
        r"""Insert a given module before a given index in the list.

        Args:
            index (int): index to insert.
            module (nn.Module): module to insert
        """
        for i in range(len(self._modules), index, -1):
            self._modules[str(i)] = self._modules[str(i - 1)]
        self._modules[str(index)] = module

本次优化将insert函数的插入范围改为了左闭右闭区间，例如[0,1],[0,0]等。修改后：

在_sub_layers为空时，可以index=0插入新的layer。
可以在位置为len(self)处的插入新的layer。

加入了相应的测试。

modified: paddle/fluid/pybind/pybind.cc modified: paddle/phi/core/memory/stats.cc modified: paddle/phi/core/memory/stats.h modified: python/paddle/device/cuda/__init__.py

modified: paddle/fluid/pybind/pybind.cc modified: python/paddle/device/cuda/__init__.py

modified: paddle/fluid/pybind/pybind.cc modified: paddle/phi/core/memory/stats.cc modified: paddle/phi/core/memory/stats.h modified: test/cpp/fluid/memory/stats_test.cc

new file: test/legacy_test/test_cuda_memory_stats.py new file: test/legacy_test/test_cuda_reset_peak_memory_stats.py

new file: test/legacy_test/test_cuda_reset_max_memory_allocated.py

modified: python/paddle/device/cuda/__init__.py modified: test/legacy_test/test_cuda_memory_stats.py modified: test/legacy_test/test_cuda_reset_max_memory_allocated.py modified: test/legacy_test/test_cuda_reset_peak_memory_stats.py

modified: paddle/fluid/pybind/pybind.cc modified: paddle/phi/core/memory/stats.cc modified: paddle/phi/core/memory/stats.h modified: test/cpp/fluid/memory/stats_test.cc

modified: python/paddle/device/cuda/__init__.py modified: test/legacy_test/test_cuda_reset_max_memory_allocated.py new file: test/legacy_test/test_cuda_reset_max_memory_reserved.py

modified: paddle/fluid/pybind/pybind.cc modified: python/paddle/device/cuda/__init__.py deleted: test/legacy_test/test_cuda_memory_stats.py deleted: test/legacy_test/test_cuda_reset_peak_memory_stats.py

modified: python/paddle/nn/layer/container.py modified: test/sot/test_15_slice.py

SigureMo · 2025-03-10T15:08:58Z

test/sot/test_15_slice.py

@@ -147,5 +147,26 @@ def test_string_slice(self):
        self.assert_results(string_slice, x)


+class TestLayerListEmptyInsert(unittest.TestCase):


这和 SOT 有关么？为什么要加在这里？

您好，因为我在test文件夹搜了一下，貌似只有SOT中有LayerList类的相关测试。

那就去 test/legacy_test 自己新增一个单测，从单测名就可以明显看出在这里加是不合适的

收到，已修改

new file: test/legacy_test/test_layerlist.py modified: test/sot/test_15_slice.py

zhwesky2010 · 2025-03-12T04:05:24Z

python/paddle/nn/layer/container.py

+                )
+            self.append(sublayer)
+            return
+


下面的assert可以挪到上面来，做一个整体的assert判断。另外这里能否不加额外分支，和下面代码采用相同实现？

收到，已修改，_get_abs_idx函数也没有考虑index=0的情况，也进行了修改。

modified: python/paddle/nn/layer/container.py modified: test/legacy_test/test_layerlist.py

zhwesky2010 · 2025-03-13T08:52:00Z

python/paddle/nn/layer/container.py

@@ -545,9 +545,11 @@ def __init__(self, sublayers: Iterable[Layer] | None = None) -> None:

    def _get_abs_idx(self, idx: int) -> int:
        if isinstance(idx, int):
-            if not (-len(self) <= idx < len(self)):
+            if not (len(self) == 0 and idx == 0) and not (


这个是不是因为原来的阈值范围不合理，原来是：[-len, len) 前开后闭，应改成 [-len, len]，对于位置为len处的插入就是在最末尾插入。你看下torch是不是这样设计的

也就是新增一个功能：在位置为len处的插入，不仅仅是针对len为0，而是一个通用功能

pytorch

pytorch中貌似并没有考虑这个问题。
我在pytorch的insert函数中加入了打印：

def insert(self, index: int, module: Module) -> None: r"""Insert a given module before a given index in the list. Args: index (int): index to insert. module (nn.Module): module to insert """ print("inner Pytorch ModuleList insert function: ",self._modules) for i in range(len(self._modules), index, -1): self._modules[str(i)] = self._modules[str(i - 1)] self._modules[str(index)] = module

并对以下代码进行了测试：

import torch modules = torch.nn.ModuleList() modules.insert(2, torch.nn.Linear(20, 20)) print("outer modules:", modules) modules.insert(3, torch.nn.Linear(30, 30)) print("outer modules:", modules) modules.insert(0, torch.nn.Linear(10, 10)) print("outer modules:", modules)

测试代码出现了报错，应该是因为没有对输入的str(i)进行限制。

inner Pytorch ModuleList insert function: OrderedDict() outer modules: ModuleList( (0): Linear(in_features=20, out_features=20, bias=True) ) inner Pytorch ModuleList insert function: OrderedDict([('2', Linear(in_features=20, out_features=20, bias=True))]) outer modules: ModuleList( (0): Linear(in_features=20, out_features=20, bias=True) (1): Linear(in_features=30, out_features=30, bias=True) ) inner Pytorch ModuleList insert function: OrderedDict([('2', Linear(in_features=20, out_features=20, bias=True)), ('3', Linear(in_features=30, out_features=30, bias=True))]) Traceback (most recent call last): File "/home/qinsx/paddle_test/Modulelist/modulelist.py", line 11, in <module> modules.insert(0, torch.nn.Linear(10, 10)) File "/home/qinsx/pyenv/pdbaddbmm/lib/python3.8/site-packages/torch/nn/modules/container.py", line 377, in insert self._modules[str(i)] = self._modules[str(i - 1)] KeyError: '1'

paddle

在paddle中，insert函数调用了_get_abs_idx函数来获取id。

def _get_abs_idx(self, idx: int) -> int: if isinstance(idx, int): if not (len(self) == 0 and idx == 0) and not ( -len(self) <= idx < len(self) ): raise IndexError( f'index {idx} is out of range, should be 0 for empty list or in range [{-len(self)}, {len(self)})' ) if idx < 0: idx += len(self) return idx

_get_abs_idx函数也会对左闭右开进行检查。我个人认为_get_abs_idx函数应该保持左闭右开，因为如果其他函数在调用时，如果获取右区间最大值会越界。我加入的特殊情况为length为0时返回idx=0。
我认为现在可以有两种方案：

方案一：保持这种修改，即对_get_abs_idx和insert函数加入length为0的特殊情况判断。

方案二：不修改_get_abs_idx函数。在insert函数中不调用_get_abs_idx函数，而仿照_get_abs_idx函数单独进行判断。

方案二代码

assert isinstance(index, int) and -len(self._sub_layers) <= index <= len( self._sub_layers ), f"index should be an integer in range [{-len(self)}, {len(self)}]" if idx < 0: idx += len(self) for i in range(len(self._sub_layers), index, -1): self._sub_layers[str(i)] = self._sub_layers[str(i - 1)] self._sub_layers[str(index)] = sublayer

@Qin-sx 我可能没描述清楚，新增的功能是指：在位置为len(self)处的插入，比如长度为0的，在0位置插入，长度为1的，在1位置插入。

试了下torch，这个功能是支持的。

@Qin-sx 按方案二实现吧，如果_get_abs_idx不能修改为前闭后闭，那就不用调这个函数。

收到，已修改，并对新增的在位置为len(self)处的插入功能加入了测试。

modified: python/paddle/nn/layer/container.py modified: test/legacy_test/test_layerlist.py

zhwesky2010

LGTM

* added reset peak value initialization modified: paddle/fluid/pybind/pybind.cc modified: paddle/phi/core/memory/stats.cc modified: paddle/phi/core/memory/stats.h modified: python/paddle/device/cuda/__init__.py * added comments modified: paddle/fluid/pybind/pybind.cc modified: python/paddle/device/cuda/__init__.py * added cpp tests modified: paddle/fluid/pybind/pybind.cc modified: paddle/phi/core/memory/stats.cc modified: paddle/phi/core/memory/stats.h modified: test/cpp/fluid/memory/stats_test.cc * added python tests new file: test/legacy_test/test_cuda_memory_stats.py new file: test/legacy_test/test_cuda_reset_peak_memory_stats.py * added a python test for reset_max_memory_allocated new file: test/legacy_test/test_cuda_reset_max_memory_allocated.py * formatted by pre-commit modified: python/paddle/device/cuda/__init__.py modified: test/legacy_test/test_cuda_memory_stats.py modified: test/legacy_test/test_cuda_reset_max_memory_allocated.py modified: test/legacy_test/test_cuda_reset_peak_memory_stats.py * formatted by pre-commit (clang-format) modified: paddle/fluid/pybind/pybind.cc modified: paddle/phi/core/memory/stats.cc modified: paddle/phi/core/memory/stats.h modified: test/cpp/fluid/memory/stats_test.cc * added reset max memory reserved function modified: python/paddle/device/cuda/__init__.py modified: test/legacy_test/test_cuda_reset_max_memory_allocated.py new file: test/legacy_test/test_cuda_reset_max_memory_reserved.py * deleted memory stats and reset peak memory stats modified: paddle/fluid/pybind/pybind.cc modified: python/paddle/device/cuda/__init__.py deleted: test/legacy_test/test_cuda_memory_stats.py deleted: test/legacy_test/test_cuda_reset_peak_memory_stats.py * optimized insert function in LayerList class modified: python/paddle/nn/layer/container.py modified: test/sot/test_15_slice.py * removed the test file new file: test/legacy_test/test_layerlist.py modified: test/sot/test_15_slice.py * optimized conditions modified: python/paddle/nn/layer/container.py modified: test/legacy_test/test_layerlist.py * restored _get_abs_idx and modified insert modified: python/paddle/nn/layer/container.py modified: test/legacy_test/test_layerlist.py

Qin-sx and others added 15 commits December 5, 2024 06:36

added reset peak value initialization

e2299f9

modified: paddle/fluid/pybind/pybind.cc modified: paddle/phi/core/memory/stats.cc modified: paddle/phi/core/memory/stats.h modified: python/paddle/device/cuda/__init__.py

added comments

dc30348

modified: paddle/fluid/pybind/pybind.cc modified: python/paddle/device/cuda/__init__.py

added cpp tests

d081dde

modified: paddle/fluid/pybind/pybind.cc modified: paddle/phi/core/memory/stats.cc modified: paddle/phi/core/memory/stats.h modified: test/cpp/fluid/memory/stats_test.cc

added python tests

35a12c8

new file: test/legacy_test/test_cuda_memory_stats.py new file: test/legacy_test/test_cuda_reset_peak_memory_stats.py

added a python test for reset_max_memory_allocated

cb5036c

new file: test/legacy_test/test_cuda_reset_max_memory_allocated.py

formatted by pre-commit

5d28856

modified: python/paddle/device/cuda/__init__.py modified: test/legacy_test/test_cuda_memory_stats.py modified: test/legacy_test/test_cuda_reset_max_memory_allocated.py modified: test/legacy_test/test_cuda_reset_peak_memory_stats.py

formatted by pre-commit (clang-format)

f6b3d84

modified: paddle/fluid/pybind/pybind.cc modified: paddle/phi/core/memory/stats.cc modified: paddle/phi/core/memory/stats.h modified: test/cpp/fluid/memory/stats_test.cc

added reset max memory reserved function

b6ea9be

modified: python/paddle/device/cuda/__init__.py modified: test/legacy_test/test_cuda_reset_max_memory_allocated.py new file: test/legacy_test/test_cuda_reset_max_memory_reserved.py

deleted memory stats and reset peak memory stats

caad368

modified: paddle/fluid/pybind/pybind.cc modified: python/paddle/device/cuda/__init__.py deleted: test/legacy_test/test_cuda_memory_stats.py deleted: test/legacy_test/test_cuda_reset_peak_memory_stats.py

Merge branch 'develop' of https://github.com/Qin-sx/Paddle into develop

299a7c0

Merge branch 'PaddlePaddle:develop' into develop

306df76

Merge branch 'PaddlePaddle:develop' into develop

968dc8b

Merge branch 'PaddlePaddle:develop' into develop

8aa753c

Merge branch 'PaddlePaddle:develop' into develop

b6e43d6

optimized insert function in LayerList class

585d00b

modified: python/paddle/nn/layer/container.py modified: test/sot/test_15_slice.py

Qin-sx requested review from SigureMo, zrr1999 and gouzil as code owners March 10, 2025 15:03

SigureMo reviewed Mar 10, 2025

View reviewed changes

paddle-bot bot added the contributor External developers label Mar 10, 2025

removed the test file

63445db

new file: test/legacy_test/test_layerlist.py modified: test/sot/test_15_slice.py

zhwesky2010 reviewed Mar 12, 2025

View reviewed changes

Qin-sx added 2 commits March 12, 2025 16:31

Merge branch 'develop' into opt-LayerList

18cc524

optimized conditions

104bbed

modified: python/paddle/nn/layer/container.py modified: test/legacy_test/test_layerlist.py

zhwesky2010 reviewed Mar 13, 2025

View reviewed changes

Qin-sx added 2 commits March 17, 2025 22:27

restored _get_abs_idx and modified insert

98a45b8

modified: python/paddle/nn/layer/container.py modified: test/legacy_test/test_layerlist.py

Merge branch 'develop' into opt-LayerList

b53a97f

zhwesky2010 approved these changes Mar 19, 2025

View reviewed changes

zhwesky2010 merged commit 212d828 into PaddlePaddle:develop Mar 19, 2025
30 checks passed

zhwesky2010 mentioned this pull request Mar 27, 2025

paddle.nn.LayerList存在问题，长度为0时无法insert #71224

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

优化LayerList类的insert函数 #71540

优化LayerList类的insert函数 #71540

Qin-sx commented Mar 10, 2025 •

edited

Loading

SigureMo Mar 10, 2025

Qin-sx Mar 10, 2025

SigureMo Mar 10, 2025

Qin-sx Mar 11, 2025

zhwesky2010 Mar 12, 2025

Qin-sx Mar 12, 2025 •

edited

Loading

zhwesky2010 Mar 13, 2025 •

edited

Loading

Qin-sx Mar 13, 2025

zhwesky2010 Mar 17, 2025 •

edited

Loading

zhwesky2010 Mar 17, 2025 •

edited

Loading

Qin-sx Mar 17, 2025

zhwesky2010 left a comment

		@@ -147,5 +147,26 @@ def test_string_slice(self):
		self.assert_results(string_slice, x)


		class TestLayerListEmptyInsert(unittest.TestCase):

优化LayerList类的insert函数 #71540

优化LayerList类的insert函数 #71540

Conversation

Qin-sx commented Mar 10, 2025 • edited Loading

PR Category

PR Types

Description

SigureMo Mar 10, 2025

Choose a reason for hiding this comment

Qin-sx Mar 10, 2025

Choose a reason for hiding this comment

SigureMo Mar 10, 2025

Choose a reason for hiding this comment

Qin-sx Mar 11, 2025

Choose a reason for hiding this comment

zhwesky2010 Mar 12, 2025

Choose a reason for hiding this comment

Qin-sx Mar 12, 2025 • edited Loading

Choose a reason for hiding this comment

zhwesky2010 Mar 13, 2025 • edited Loading

Choose a reason for hiding this comment

Qin-sx Mar 13, 2025

Choose a reason for hiding this comment

pytorch

paddle

方案二代码

zhwesky2010 Mar 17, 2025 • edited Loading

Choose a reason for hiding this comment

zhwesky2010 Mar 17, 2025 • edited Loading

Choose a reason for hiding this comment

Qin-sx Mar 17, 2025

Choose a reason for hiding this comment

zhwesky2010 left a comment

Choose a reason for hiding this comment

Qin-sx commented Mar 10, 2025 •

edited

Loading

Qin-sx Mar 12, 2025 •

edited

Loading

zhwesky2010 Mar 13, 2025 •

edited

Loading

zhwesky2010 Mar 17, 2025 •

edited

Loading

zhwesky2010 Mar 17, 2025 •

edited

Loading