LLM-jp-4 8B Instruct (static Q4_K_M GGUF)

This is a static Q4_K_M quantized GGUF version of llm-jp/llm-jp-4-8b-instruct, converted and quantized using llama.cpp version 8740.

Note: The chat template has been slightly modified:

-<|start|>assistant
+<|start|>assistant<|channel|>final<|message|>

How to Use (llama-cli)

llama-cli -m llm-jp-4-8b-instruct_Q4_K_M.gguf -cnv -c 4096

Risks and Limitations

The models released here are in the early stages of our research and development and have not been tuned to ensure outputs align with human intent and safety considerations.

License

Apache License, Version 2.0 β€” same as the base model.

Downloads last month
70
GGUF
Model size
9B params
Architecture
llama
Hardware compatibility
Log In to add your hardware

4-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. πŸ™‹ Ask for provider support

Model tree for rinrin0413/llm-jp-4-8b-instruct-Q4_K_M-GGUF

Quantized
(3)
this model