ModelPruning callback - hard pruning #19347

ilya-SX · 2024-01-25T14:39:15Z

ilya-SX
Jan 25, 2024

Hi all,

I am performing a structural pruning using the pruning callback (pytorch_lightning.callbacks.ModelPruning) and since it performs soft pruning only (replacing weights with zeros) I see almost no difference in model latency when converted to ONNX. It seems that to reduce model latency hard pruning (removing zero weights) should be performed.

My questions are:

Is it possible to perform hard pruning using this callback?
What is the goal of soft pruning (besides slight latency reduction and dropout-like effect)? What is the point of using this callback if it doesn't actually reduce latency (the goal of pruning)?

Thanks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ModelPruning callback - hard pruning #19347

{{title}}

Replies: 0 comments

Select a reply

ModelPruning callback - hard pruning #19347

ilya-SX Jan 25, 2024

Replies: 0 comments

ilya-SX
Jan 25, 2024