Ktezcan/dev/iss941 encode targets sepfstep #1019

kctezcan · 2025-09-30T16:23:42Z

Description

This is an dditional PR over a previous PR: #961

The previous one introduces a new function to embed cells for the targets. This PR uses the existing embed_cells() function to embed the target tokens. The purpose is to reduce duplicated code and prevent potential "code rot" etc..

I have tested both training and inference with this.

Issue Number

Ref #941
Refs #941
Closes #941
Closes #941

Checklist before asking for review

I have performed a self-review of my code
My changes comply with basic sanity checks:
- I have fixed formatting issues with ./scripts/actions.sh lint
- I have run unit tests with ./scripts/actions.sh unit-test
- I have documented my code and I have updated the docstrings.
- I have added unit tests, if relevant
I have tried my changes with data and code:
- I have run the integration tests with ./scripts/actions.sh integration-test
- (bigger changes) I have run a full training and I have written in the comment the run_id(s): launch-slurm.py --time 60
- (bigger changes and experiments) I have shared a hegdedoc in the github issue with all the configurations and runs for this experiments
I have informed and aligned with people impacted by my change:
- for config changes: the MatterMost channels and/or a design doc
- for changes of dependencies: the MatterMost software development channel

…941_encode_targets_sepfstep

TillHae · 2025-10-15T14:42:31Z

src/weathergen/datasets/tokenizer_forecast.py

        time_win: tuple,
-        normalizer,  # dataset
+        normalizer,  # dataset,
+        use_normalizer: str,  # "source_normalizer" or "target_normalizer"


Rename use_normalizer to channel_to_normalize. Even though the type and possible values are clearly documented use_normalizer indicates for a boolean value.
Another option is to rename normalizer to normaliser_datasetor normaliser_dsso you can use normalizer instead of use_normalizer

TillHae · 2025-10-15T14:48:02Z

src/weathergen/datasets/utils.py

+            )
+            for stl_b in batch
+        ]
+    )


Use less lines, because it looks more complex than it actually is.

target_source_like_tokens_lens = torch.stack([ torch.stack([ torch.stack([ s.target_source_like_tokens_lens[fstep] if len(s.target_source_like_tokens_lens[fstep]) > 0 else torch.tensor([]) for fstep in range(len(s.target_source_like_tokens_lens)) ]) for s in stl_b ]) for stl_b in batch ])

If this was caused by ruff then just forget about this comment...

TillHae · 2025-10-15T14:50:16Z

src/weathergen/datasets/utils.py

+    for ib, sb in enumerate(batch):
+        for itype, s in enumerate(sb):
+            for fstep in range(offsets.shape[0]):
+                if target_source_like_tokens_lens[ib, itype, fstep].sum() != 0:  # if not empty


Replace with if target_source_like_tokens_lens[ib, itype, fstep].sum() != 0: with if any(target_source_like_tokens_lens[ib, type, fstep]): for better efficiency.

TillHae · 2025-10-15T14:56:20Z

src/weathergen/datasets/stream_data.py

+        # batch sample list when non-empty
+        for fstep in range(len(self.target_source_like_tokens_cells)):
+            if (
+                torch.tensor([len(s) for s in self.target_source_like_tokens_cells[fstep]]).sum()


Replace

if ( torch.tensor([len(s) for s in self.target_source_like_tokens_cells[fstep]]).sum() > 0 ):

with

if any(len(s) > 0 for s in self.target_source_like_tokens_cells[fstep]):

for slightly better efficiency.

Maybe you can find a way to replace len(s) with a way to do the check in constant time without having to write multiple lines of code.

TillHae · 2025-10-15T14:59:19Z

src/weathergen/datasets/tokenizer_masking.py

        times: np.array,
        time_win: tuple,
        normalizer,  # dataset
+        use_normalizer: str,  # "source_normalizer" or "target_normalizer"


Rename use_tokenizer as you did in tokeniser_forecast.py(see first comment)

TillHae · 2025-10-15T15:02:49Z

src/weathergen/model/model.py

+                        tokens_target_det = tokens_target.detach()  # explicitly detach as well
+                        tokens_targets.append(tokens_target_det)
+
+        return_dict = {"preds_all": preds_all, "posteriors": posteriors}


Initiate return_dictabove first if check on "encode_targets_latent".
Move the key accesses on return_dict at the end of the first if check on "encode_targets_latent".
Remove the second if check on "encode_targets_latent".

TillHae · 2025-10-15T15:06:03Z

src/weathergen/model/model.py

+                #     # we don't append an empty tensor for the source
+                #     tokens_all.append(torch.tensor([], dtype=self.dtype, device="cuda"))
+                # el
+                if source_tokens_lens.sum() != 0:


Replace if source_tokens_lens.sum() != 0: with if source_tokens_lens.any(): for better efficiency

kctezcan added 23 commits September 25, 2025 16:09

initial changes

1ab20ce

clean up debug statements

7a74aaa

reading model output as dictionary

c4129a3

added the config parameter with false as def

4c2d12f

ruff

029d0ad

.._normlaizer and removed one KCT comment

6f61c16

removed a KCT and corrected some ..._normalizer comments

2f4197e

some comments and ruff changes

f73adda

ruff

cd30eb1

removed run_evaluate

5f42d43

using !=

5a38dc7

merged changes from the upper branch

d3976fb

removed some comments >>>>

721de02

renamed everything srclk -> source_like

44f4a11

addressed ruff errors

b16d6ae

ruff

4304352

removed some KCt comments

97ffb58

Merge branch 'ktezcan/dev/iss941_encode_targets' into ktezcan/dev/iss…

c8fc70d

…941_encode_targets_sepfstep

implemented a common embed_cells() for both source and targets

f432e30

Merge branch 'develop' into ktezcan/dev/iss941_encode_targets_sepfstep

88ffff5

appending an empty tensor for target even if no tokens to embed

a367dc6

appending empty tensor for offsetted forecast steps

c78c3c5

ruff

32b5970

github-project-automation bot added this to WeatherGen-dev Sep 30, 2025

kctezcan added 2 commits October 1, 2025 14:51

Merge branch 'develop' into ktezcan/dev/iss941_encode_targets_sepfstep

d76ea22

Merge branch 'develop' into ktezcan/dev/iss941_encode_targets_sepfstep

036ab9e

TillHae suggested changes Oct 15, 2025

View reviewed changes

github-project-automation bot moved this to In Progress in WeatherGen-dev Oct 15, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Ktezcan/dev/iss941 encode targets sepfstep #1019

Ktezcan/dev/iss941 encode targets sepfstep #1019

Uh oh!

kctezcan commented Sep 30, 2025 •

edited

Loading

Uh oh!

TillHae Oct 15, 2025

Uh oh!

TillHae Oct 15, 2025

Uh oh!

TillHae Oct 15, 2025

Uh oh!

TillHae Oct 15, 2025

Uh oh!

TillHae Oct 15, 2025

Uh oh!

TillHae Oct 15, 2025

Uh oh!

TillHae Oct 15, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Ktezcan/dev/iss941 encode targets sepfstep #1019

Are you sure you want to change the base?

Ktezcan/dev/iss941 encode targets sepfstep #1019

Uh oh!

Conversation

kctezcan commented Sep 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Issue Number

Checklist before asking for review

Uh oh!

TillHae Oct 15, 2025

Choose a reason for hiding this comment

Uh oh!

TillHae Oct 15, 2025

Choose a reason for hiding this comment

Uh oh!

TillHae Oct 15, 2025

Choose a reason for hiding this comment

Uh oh!

TillHae Oct 15, 2025

Choose a reason for hiding this comment

Uh oh!

TillHae Oct 15, 2025

Choose a reason for hiding this comment

Uh oh!

TillHae Oct 15, 2025

Choose a reason for hiding this comment

Uh oh!

TillHae Oct 15, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

kctezcan commented Sep 30, 2025 •

edited

Loading