Replies: 2 comments
-
Any resolution on this ? |
Beta Was this translation helpful? Give feedback.
0 replies
-
@dhingratul, can you share full stack trace? I suspect the problem is that partitioned parameters are not gathered before access:
Such problems are often solved using the parameter gathering context manager like |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Hello, deep speed users
I'm trying to learn from github code (https://github.com/arcee-ai/DistillKit/blob/main/distil_hidden.py) using Zero3 settings, but I get an error called "RuntimeError: 'weight must be 2-D'.
Can you help me?
config
Beta Was this translation helpful? Give feedback.
All reactions