-
Notifications
You must be signed in to change notification settings - Fork 0
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Deploy and test SLURM setup #1
Comments
@n8layman, when @espirado has you test this, look into: |
@n8layman @collinschwantes I have updated the README for testing purposes. Could you test and create issues on your findings/suggestions. |
From my conversation with @espirado today I won't be able to fork the repo and test the container on my local machine due to incompatibilities with ARM architecture. We're working on setting up a VM so I can remotely test the SLURM workflow. |
Since the code change to deploy slurm based reservior is huge will be creating a separate repository for the new code from the eha-server. |
Deployed the first attempt on Aegypti got the containers to run but due to hardware incompatibility for GPU and most drivers had to remove GPU. Hopefully will have the usable access tomorrow |
@n8layman can we schedule a test session for slurm I have it set up on aegypti. Can you first try to access via |
Sounds great. I can access the controller using ssh as above. |
We successfully did a run test on the base slurm environment and worked as expected and will proceed in creating a repo with adequate code examples to integrate various R/Python workflows with slurm for different types of workloads and efficient cluster usage. @n8layman will also assist in coming up with examples that we can use for M3 . |
Excellent! |
Notes from review
All issues Raised will be addressed on next merge request and documentation both for workflow examples and Infrastucture deployment. |
Added all nodes(Prospero,sycorax.aegypti). Tests for targets working. |
No description provided.
The text was updated successfully, but these errors were encountered: