Skip to content

Commit

Permalink
feat(aio): switch to llama3-based for LLM (#2225)
Browse files Browse the repository at this point in the history
Signed-off-by: mudler <[email protected]>
  • Loading branch information
mudler authored May 2, 2024
1 parent 2cc1bd8 commit a31d00d
Show file tree
Hide file tree
Showing 3 changed files with 3 additions and 3 deletions.
2 changes: 1 addition & 1 deletion aio/cpu/text-to-text.yaml
Original file line number Diff line number Diff line change
@@ -1,7 +1,7 @@
name: gpt-4
mmap: true
parameters:
model: huggingface://NousResearch/Hermes-2-Pro-Mistral-7B-GGUF/Hermes-2-Pro-Mistral-7B.Q2_K.gguf
model: huggingface://NousResearch/Hermes-2-Pro-Llama-3-8B-GGUF/Hermes-2-Pro-Llama-3-8B-Q4_K_M.gguf

template:
chat_message: |
Expand Down
2 changes: 1 addition & 1 deletion aio/gpu-8g/text-to-text.yaml
Original file line number Diff line number Diff line change
@@ -1,7 +1,7 @@
name: gpt-4
mmap: true
parameters:
model: huggingface://NousResearch/Hermes-2-Pro-Mistral-7B-GGUF/Hermes-2-Pro-Mistral-7B.Q6_K.gguf
model: huggingface://NousResearch/Hermes-2-Pro-Llama-3-8B-GGUF/Hermes-2-Pro-Llama-3-8B-Q4_K_M.gguf

template:
chat_message: |
Expand Down
2 changes: 1 addition & 1 deletion aio/intel/text-to-text.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -2,7 +2,7 @@ name: gpt-4
mmap: false
f16: false
parameters:
model: huggingface://NousResearch/Hermes-2-Pro-Mistral-7B-GGUF/Hermes-2-Pro-Mistral-7B.Q6_K.gguf
model: huggingface://NousResearch/Hermes-2-Pro-Llama-3-8B-GGUF/Hermes-2-Pro-Llama-3-8B-Q4_K_M.gguf

template:
chat_message: |
Expand Down

0 comments on commit a31d00d

Please sign in to comment.