Try theta_2 cuda upload to use Triple sum #422
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
The following errors may occur when using the "Triple sum" function.
The problem is,
code in "mergers.py", 418 line
To solve this, I add simple check code for theta_2
This method increases GPU memory usage by putting theta_2 on the GPU, but it succeeds if the GPU memory capacity is sufficient.
In my testing, I can do Triplesum on an RTX3080Ti with 12GB of VRAM. But the memory usage is pretty close.
Please review my commit and I hope it will help you.
Added: Fix exception handling
Updated the exception handling from except NameError to the more general except Exception to handle 2 model merge
to