Falied to acquire lock on file #523
-
Hi, I am having some trouble saving a checkpoint. My code is very simple: PyTreeCheckpointer().save(dir_ / 'actor_params', controller._actor_params) and works fine on my local machine. However, when running on an HPC cluster, I get the following error:
There are no access problems i.e. checkpointing with |
Beta Was this translation helpful? Give feedback.
Replies: 3 comments 1 reply
-
This issue might be related to https://github.com/google/tensorstore. Can you please try their discussion forum? |
Beta Was this translation helpful? Give feedback.
-
I have the same problem on hpc server. Have you fixed it? |
Beta Was this translation helpful? Give feedback.
-
Please consider reporting it to https://github.com/google/tensorstore/issues directly. |
Beta Was this translation helpful? Give feedback.
Please consider reporting it to https://github.com/google/tensorstore/issues directly.