-
Notifications
You must be signed in to change notification settings - Fork 20
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
about the input/output size #8
Comments
Hi @Danee-wawawa,
This depends on several factors. In the simplest case -- if your data is images, and you would like to perform the translation between the images of the same size (e.g.
In a case, as I have described above, one would need to modify the data configuration of the training script. Taking uvcgan2/scripts/celeba_hq/train_m2f_translation.py Lines 63 to 76 in 8f4b1cb
One would need to modify If your case is more complicated, more modifications may be required. Please let me know if you have further questions. |
Thank you for your reply. Now,512 x 512 -> 512 x 512 is OK and I want to try 640 x 512 -> 640 x 512. Does this situation require modifying the network structure? |
No, you do not need to modify the network structure for With that said, it may be helpful to tune the network structure a bit to achieve the best performance, but it is not necessary. |
OK, thank you~~ |
你好@usert5432,
Sorry to bother you, I also have a question about image size. If my image shape is (3,512,512), then the
changed to
The following three lines:
Does it need to be changed accordingly to
Or what about other numbers? |
Hi @Pudding-0503, The data transformations are heavily dependent on the dataset that you have. For instance, if you have a large dataset (>= 5k images). And, if the objects that you want to translate have approximately the same size. Then, perhaps, you do not need to apply any transformations at all (or limit them just to a random horizontal flip). And, in general, I would suggest to start with only 'transform_train' : [
'random-flip-horizontal',
], And see how the translation works. If it does not work, then adjust the network hyperparameters (as described in the README). And, if it still does not work, then add new transformations. |
OK! I got it, thank you very much~~~ |
Hi @Danee-wawawa. Can you try switching the order of dimensions? That is, setting |
It is OK, thank you~ |
Hi @sophiatmu, Unfortunately, I do not think uvcgan will work on images of size |
Excuse me, is it true that higher resolution results in better image translation and higher metrics |
Hello @create-li
Unfortunately, we have not tried training on the higher resolution images, so I am not sure. Perhaps, somebody in this thread could provide feedback on higher resolution training. |
Hi,thank you for your work.
Does the input size and output size support other sizes, such as 640512 or 512512? If possible, where the code needs to be modified?
Looking forward to your answer.
The text was updated successfully, but these errors were encountered: