Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
[GPT-3] Support Grad Merge with FP32 main grad for BF16 training of GPT-3 model #1053
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[GPT-3] Support Grad Merge with FP32 main grad for BF16 training of GPT-3 model #1053
Changes from 27 commits
5cf6743
52c7a83
840ce2f
2f9a039
2915f57
59546ba
86b574c
21760c0
c91472d
cd14fac
8a60a76
6bb1e03
cf7fa38
3012033
75b9a9f
58a337f
d239077
b8606bb
bc951b2
43023a5
a589a35
65d3d17
1ea9406
674e5fa
ba932a9
ea77797
325d81c
fa6fa48
1874cb6
62f32a9
974e1d2
File filter
Filter by extension
Conversations
Jump to
There are no files selected for viewing