-
Notifications
You must be signed in to change notification settings - Fork 18
Pull requests: HabanaAI/vllm-hpu-extension
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Enables multinode calibration flow through calibrate_model.sh
#92
opened Feb 7, 2025 by
vishnumadhu365
Loading…
Add renormalize parameter for FusedMOE's & modify experts_max arg of mixture_of_experts()
#70
opened Jan 9, 2025 by
tangleintel
Loading…
[WIP] Add option to do group sum on TPC instead of MME
#64
opened Dec 20, 2024 by
mswiniarsk
•
Draft
ProTip!
Filter pull requests by the default branch with base:main.