TFMC/imatrix-dataset-for-japanese-llm
Viewer • Updated • 239 • 303 • 34
This model is quantized version of DeepSeek-R1-Distill-Qwen-14B with dataset for imatrix TFMC/imatrix-dataset-for-japanese-llm. Usgin English/Japanese mixed and quantization is tuned for Japanese.
This code repository and the model weights are licensed under the MIT License. DeepSeek-R1 series support commercial use, allow for any modifications and derivative works, including, but not limited to, distillation for training other LLMs. Please note that:
4-bit
Base model
deepseek-ai/DeepSeek-R1-Distill-Qwen-14B