Skip to content

Conversation

daniellepintz
Copy link
Contributor

@daniellepintz daniellepintz commented Oct 10, 2025

Adding a config to run SFT on Qwen3 8B. This requires making some updates to tokenizer.py

Test Plan:

python -m apps.sft_v2.main --config apps/sft_v2/qwen3_8b.yaml
python -m apps.sft_v2.main --config apps/sft_v2/llama3_8b.yaml

@meta-cla meta-cla bot added the CLA Signed This label is managed by the Meta Open Source bot. label Oct 10, 2025
@daniellepintz daniellepintz marked this pull request as ready for review October 13, 2025 18:50
Copy link
Contributor

@ebsmothers ebsmothers left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks for adding this! One small nit but lgtm

Comment on lines +4 to +5
# TODO: required by torchtitan
# https://github.com/pytorch/torchtitan/blob/2f1c814da071cc8ad165d00be6f9c1a66f8e1cce/torchtitan/distributed/utils.py#L265
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

can remove this now?

Copy link
Contributor Author

@daniellepintz daniellepintz Oct 13, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@daniellepintz daniellepintz merged commit b7d2526 into main Oct 13, 2025
6 checks passed
@daniellepintz daniellepintz deleted the daniellepintz-dev branch October 13, 2025 20:40
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

CLA Signed This label is managed by the Meta Open Source bot.

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants