Experimental interface for torch ops · alihassanijr/NATTEN-Torch@18ecf32

Commit

Experimental interface for torch ops

See SHI-Labs#184

Only supports forward pass for now, due to current limitations of
registering custom ops with torch compared to autograd functions. Some
of those limitations are:

* No stable interface for supporting autocasting to fp16/bf16,
  * Gradient scaling doesn't seem to be supported either, leading to
    training instability.

* Ops cannot indicate that they expect contiguous operands, and need to
  call `.contiguous()` within, and this incurs additional tensor copy
  costs, and brings down throughput (in some cases it's hard to even
  tell the difference between compiled and eager.)

Loading branch information

alihassanijr committed Dec 10, 2024

1 parent ee45f9d commit 18ecf32

0 comments on commit `18ecf32`

Please sign in to comment.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Commit

There are no files selected for viewing

0 comments on commit `18ecf32`

Commit

There are no files selected for viewing

0 comments on commit 18ecf32

0 comments on commit `18ecf32`