-
Notifications
You must be signed in to change notification settings - Fork 108
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Bug]: stuck when running puma #443
Comments
Hi @hkz103 Can you share your machine spec? |
linux ubuutu 20.04,gpu is a100,secretflow is built with docker |
what about ram? puma needs around 1TB ram per node. |
In your case, you were running three nodes on one machine, you would need approximately 3TB of ram. |
If I run puma on gpt2 or other smaller models, do I need less ram? |
yes, there is a gpt2 example, you try that. |
Stale issue message. Please comment to remove stale tag. Otherwise this issue will be closed soon. |
Issue Type
Usability
Modules Involved
Documentation/Tutorial/Example
Have you reproduced the bug with SPU HEAD?
Yes
Have you searched existing issues?
Yes
SPU Version
spu 0.7.0, secretflow1.3.0
OS Platform and Distribution
linux ubuntu20.04
Python Version
3.8
Compiler Version
gcc 10.5
Current Behavior?
When I running puma, the example is stuck at:
"Run on CPU
Q: What is the largest animal?
A: The (no output)
Run on SPU"
(stuck)"
Standalone code to reproduce the issue
Relevant log output
No response
The text was updated successfully, but these errors were encountered: