SJTU SLURM Skill
by @taleintervenor
Log in to the SJTU HPC platform (also known as "交我算") as the user to perform job queries, submissions, cancellations, and data management. Use this skill whe...
1. Ensure HPC API token file is available in the workspace, which should be stored in credentials directory. If not, request a new token.
2. For each user request, analyze whether you need to log in to an HPC entry node to perform the operation remotely. You can ask more questions to clarify any ambiguous parts of the request.
- If user wants to know its storage quota usage or update its account (like password, binding Email/jAccount, preferred contact method), then you can meet the requirement directly by calling HPC API with the token.
- if user is talking about job or its data, then you have to select an entry node to do remote operation. In this case, take the following steps.
- if user want to get a passwordless certificate for SSH login, refer to certificates section.
3. Ensure SSH keys and certificates are available in the workspace, which should be stored in credentials directory. If not, request a new SSH certificate for the user. Remind the user that requesting a certificate will trigger two-factor authentication.
4. For each user request need remote operation, identify the node group/partition the user is interested in and the operation type to select the correct entry node.
5. Use the SSH certificate to connect to the corresponding entry node based on the cluster and operation type, execute the user's requested operations on it.
clawhub install sjtu-slurm-skill