Skip to content

b4771

Compare
Choose a tag to compare
@github-actions github-actions released this 25 Feb 10:21
3e9a286
llama : expose llama_model_n_head_kv in the API (#11997)

It's useful to be able to have this from the library layer as it's a key
parameter of the model (e.g. to figure out how much KV cache memory is
needed).