ModelPruning callback - hard pruning #19347
Unanswered
ilya-SX
asked this question in
code help: CV
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Hi all,
I am performing a structural pruning using the pruning callback (
pytorch_lightning.callbacks.ModelPruning
) and since it performs soft pruning only (replacing weights with zeros) I see almost no difference in model latency when converted to ONNX. It seems that to reduce model latency hard pruning (removing zero weights) should be performed.My questions are:
Thanks
Beta Was this translation helpful? Give feedback.
All reactions