library_name: transformers license: apache-2.0 datasets: - schneewolflabs/Athanor-DPO base_model: - nbeerbower/Schreiber-mistral-nemo-12B
Test model for Merlina. This is a simple 2 epoch ORPO run with beta=0.12.
beta=0.12