You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I train a CNN model with say 40,000,000 parameters it's way faster than a mamba model with 721,079 params.
Is it usual or something wrong on my side?
I tried it on a desktop 4090 and a server with A100. The output is the same for both of them.
The text was updated successfully, but these errors were encountered:
I train a CNN model with say 40,000,000 parameters it's way faster than a mamba model with 721,079 params.
Is it usual or something wrong on my side?
I tried it on a desktop 4090 and a server with A100. The output is the same for both of them.
The text was updated successfully, but these errors were encountered: