Qwen3.5-4B-MiniFantasy โ€” GGUF

Model Banner

GGUF quantizations of Nubinu/Qwen3.5-4B-MiniFantasy โ€” a 4-bit LoRA fine-tune of MuXodious/Qwen3.5-4B-SOMPOA-heresy-v2, focused on multi-turn narrative roleplay, emotional resonance, and character-driven storytelling.


Available Quants

Quantization Size
Q4_K_M ~2.71 GB
Q5_K_M ~3.07 GB
Q6_K_M ~3.46 GB
Q8_0 ~4.48 GB
F16 ~8 GB

SillyTavern Setup

Sampler Settings

For best narrative pacing and to prevent repetition, use appropriate RP sampler settings.


Character Card Format ({{description}} block)

The model was trained on a category-based Markdown structure. For best adherence to personality and lore, structure your character cards like this:

## Identity
- Name: [Full Name]
- Age: [Age]
- Race/Species: [Race]
- Role/Occupation: [Role and relationship]

## Appearance
- [Height, general build]
- [Specific physical features, hair, eyes, etc.]
- Clothing: [Current outfit details]

## Personality
- Public: [Outward facade]
- Private: [True self]
- [1-2 extra bullet points on core personality traits]

## Speech & Quirks
- [Vocal tone and speaking style]
- [Physical habit or nervous tick]
- [How they show affection]

## Backstory & World Context
- [Origin]
- [Key past event]
- [Current situation]

## Goals & Motivations
- Short term: [Immediate goals]
- Long term: [Big picture goals]

About the Base Model


Downloads last month
288
GGUF
Model size
4B params
Architecture
qwen35
Hardware compatibility
Log In to add your hardware

4-bit

5-bit

6-bit

8-bit

16-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for Nubinu/Qwen3.5-4B-MiniFantasy-GGUF

Finetuned
Qwen/Qwen3.5-4B
Quantized
(2)
this model