Add optimization to dynamo-based exporter #1541

titaiwangms · 2025-01-08T00:52:54Z

Describe your changes

Checklist before requesting a review

Add unit tests for this change.
Make sure all tests can pass.
Update documents if necessary.
Lint and apply fixes to your code by running lintrunner -a
Is this a user-facing change? If yes, give a description of this change to be included in the release notes.
Is this PR including examples changes? If yes, please remember to update example documentation in a follow-up PR.

(Optional) Issue link

justinchuby · 2025-01-08T06:15:12Z

LGTM, cc @jambayk for review

xadupre · 2025-01-08T11:01:15Z

olive/passes/onnx/conversion.py

@@ -106,6 +106,14 @@ def _default_config(cls, accelerator_spec: AcceleratorSpec) -> Dict[str, PassCon
            "dynamic": PassConfigParam(
                type_=bool, default_value=True, description=("Whether to export the model with dynamic axes/shapes.")
            ),
+            "optimization": PassConfigParam(


This optimization includes onnxruntime contrib ops as well or just standard onnx?

I think it's this one, so no. https://github.com/microsoft/onnxscript/blob/5e7b0e4b626023d9b726b3c8b8336698f1ac4537/onnxscript/optimizer/_optimizer.py#L29

cc @gramalingam

We probably will need to support both? Should we have two flags? Or make this an enumeration instead of bool, so we can decide whether to apply ORT specific optimizations too? Depends on what the intended use of this Olive pass is ... is it intended only for ORT users?

The current design in PyTorch let users to decide whether to optimize or not. This flag simply follows the idea. I personally think adding another optimization flag for ORT is confusing and complicated. If we really want ORT optimization to be an option, I suggest we can change PyTorch exporter to default applying regular optimization, and optimize() method refers to applying ORT specific optimization.

Having optimize to target ort in pytorch is not the most ideal. We could expose a target option in the optimize API. However depending on what Olive wants to provide to its users, I think a similar string option is reasonable. It could default to ORT, similar to what model builder does right now. Although there are pushes for MB to be more generic.

Yes, that [responding to Titai's message above] is reasonable: specifically, let's forget pytorch here, and focus on olive users. The olive pass can internally always do generic optimization, no need to expose it to user as an option at all. We could expose ort-optimization as an option ... but even that is necessary only if we expect users to use olive for ORT and non-ORT. Unclear if we need that. That's why I am asking about users of this olive pass ... can we assume it is going to be only for ORT users? (We can't make that assumption for pytorch exporter.)

I think even for ORT users, contrib operators are not supported for all target use cases. For instance, NPU targets which require QDQ models might not support any contrib operators.

What do you use as user input to make the decisions? Some combination of (EP, dtype)? As long as we have the necessary information to select the appropriate fusion optimizations, we should be fine

In passes like OrtTransformersOptimization, we make some decisions on what optimizations to enable/disable based on the accelerator spec which includes the target device (cpu, gpu, npu) and EP. dtype is already a config option for this conversion pass so that info is also available if needed.

add optimization

ca557af

titaiwangms requested a review from justinchuby January 8, 2025 00:54

justinchuby requested a review from jambayk January 8, 2025 06:14

xadupre reviewed Jan 8, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add optimization to dynamo-based exporter #1541

Add optimization to dynamo-based exporter #1541

titaiwangms commented Jan 8, 2025

justinchuby commented Jan 8, 2025

xadupre Jan 8, 2025

titaiwangms Jan 8, 2025

titaiwangms Jan 8, 2025

gramalingam Jan 8, 2025

titaiwangms Jan 8, 2025

justinchuby Jan 8, 2025

gramalingam Jan 8, 2025 •

edited

Loading

jambayk Jan 8, 2025

gramalingam Jan 8, 2025

jambayk Jan 8, 2025 •

edited

Loading

Add optimization to dynamo-based exporter #1541

Are you sure you want to change the base?

Add optimization to dynamo-based exporter #1541

Conversation

titaiwangms commented Jan 8, 2025

Describe your changes

Checklist before requesting a review

(Optional) Issue link

justinchuby commented Jan 8, 2025

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

gramalingam Jan 8, 2025 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jambayk Jan 8, 2025 • edited Loading

Choose a reason for hiding this comment

gramalingam Jan 8, 2025 •

edited

Loading

jambayk Jan 8, 2025 •

edited

Loading