Model Card

Model Description

This is a Large Language Model (LLM) trained on a subset of the dataset "mlabonne/orpo-dpo-mix-40k".

Metric	Value
Accuracy	0.4517

To use this model, simply download the checkpoint and load it into your preferred deep learning framework.

Safetensors

Model size

1B params

Tensor type

F32

Base model

Finetuned

(883)

this model