-
Notifications
You must be signed in to change notification settings - Fork 8
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
[perf] [refactor] Reduce kernel launch context construction overhead …
…(#3947) * Set external array launch context in C++ scope * Shorten execution path for ndarray in kernel_impl * Fix small problem on shape argument. * Auto Format * Fix naming and int ptr type convention * Add another args set method to reduce launch overhead. * Also check the annotations in the Ndarray shortcut. * Fix the ndarray_use_torch access method after merge with master branch. * Auto Format * Revert the shortcut loop in order to respect original branch logics. * Code formatting. * Revise func names and argument lists. Co-authored-by: Taichi Gardener <[email protected]>
- Loading branch information
1 parent
551af9c
commit ef6237a
Showing
4 changed files
with
44 additions
and
20 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters