Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

train.py #13

Open
hhb1224 opened this issue May 19, 2021 · 2 comments
Open

train.py #13

hhb1224 opened this issue May 19, 2021 · 2 comments

Comments

@hhb1224
Copy link

hhb1224 commented May 19, 2021

Hi, Thank you for approaching me efficient pruning method.

I implemented 1 of 2 pruning steps(Training & Masked retraining). While I train with train.py, 4 batch size, and 25 epoch, It implements 0-24 steps and again and again.. When will it stop itself? and what does this iteration means?

@nightsnack
Copy link
Owner

Hi hhb1224,
The first step includes 4 iterations of rho (in cfg/darknet_admm.yaml: --rou-num) each iteration uses a different rho value. Starting from 0.0001, the rou value increases by 10x each iteration, which means the punishment of pruning loss increases by 10x each iteration. For more details of admm algorithms please check this paper: A Systematic DNN Weight Pruning Framework using Alternating Direction Method of Multipliers https://arxiv.org/pdf/1804.03294.pdf

@baisong666
Copy link

我第一步训练完后生成的4个pt文件都是256M,这正常吗,第二步最后生成的模型文件竟然有500M那末大,请问为什幺呢,谢谢拉

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants