Text Generation
Transformers
Safetensors
English
mistral
code
sft
rl
rlvr
grpo
text-generation-inference

RenCoder-Devstral-Small-2507

This model is a SFT + RLVR (DPO+GRPO) version of mistralai/Devstral-Small-2507 on muliple agentic coding datasets (SWE-Bench, etc).

"Obsessed with building Open Source AGI, So am I ! Let's create together 🚀 https://www.linkedin.com/in/pankajam"

Model Details

Usage

License

This model inherits the Apache 2.0 license from the base Devstral-Small-2507 model.

Acknowledgements

Downloads last month
111
Safetensors
Model size
24B params
Tensor type
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for pankajmathur/RenCoder-Devstral-Small-2507

Datasets used to train pankajmathur/RenCoder-Devstral-Small-2507