Instructions to use MuXodious/Luna-7B-A4B-absolute-heresy with libraries, inference providers, notebooks, and local apps. Follow these links to get started.

Libraries

How to use MuXodious/Luna-7B-A4B-absolute-heresy with Transformers:

# Use a pipeline as a high-level helper
from transformers import pipeline

pipe = pipeline("text-generation", model="MuXodious/Luna-7B-A4B-absolute-heresy")
messages = [
    {"role": "user", "content": "Who are you?"},
]
pipe(messages)

# Load model directly
from transformers import AutoTokenizer, AutoModelForCausalLM

tokenizer = AutoTokenizer.from_pretrained("MuXodious/Luna-7B-A4B-absolute-heresy")
model = AutoModelForCausalLM.from_pretrained("MuXodious/Luna-7B-A4B-absolute-heresy")
messages = [
    {"role": "user", "content": "Who are you?"},
]
inputs = tokenizer.apply_chat_template(
	messages,
	add_generation_prompt=True,
	tokenize=True,
	return_dict=True,
	return_tensors="pt",
).to(model.device)

outputs = model.generate(**inputs, max_new_tokens=40)
print(tokenizer.decode(outputs[0][inputs["input_ids"].shape[-1]:]))

Notebooks
Google Colab
Kaggle
Local Apps

vLLM

How to use MuXodious/Luna-7B-A4B-absolute-heresy with vLLM:

Install from pip and serve model

# Install vLLM from pip:
pip install vllm
# Start the vLLM server:
vllm serve "MuXodious/Luna-7B-A4B-absolute-heresy"
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:8000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "MuXodious/Luna-7B-A4B-absolute-heresy",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Use Docker

docker model run hf.co/MuXodious/Luna-7B-A4B-absolute-heresy

SGLang

How to use MuXodious/Luna-7B-A4B-absolute-heresy with SGLang:

Install from pip and serve model

# Install SGLang from pip:
pip install sglang
# Start the SGLang server:
python3 -m sglang.launch_server \
    --model-path "MuXodious/Luna-7B-A4B-absolute-heresy" \
    --host 0.0.0.0 \
    --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "MuXodious/Luna-7B-A4B-absolute-heresy",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Use Docker images

docker run --gpus all \
    --shm-size 32g \
    -p 30000:30000 \
    -v ~/.cache/huggingface:/root/.cache/huggingface \
    --env "HF_TOKEN=<secret>" \
    --ipc=host \
    lmsysorg/sglang:latest \
    python3 -m sglang.launch_server \
        --model-path "MuXodious/Luna-7B-A4B-absolute-heresy" \
        --host 0.0.0.0 \
        --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "MuXodious/Luna-7B-A4B-absolute-heresy",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Docker Model Runner
How to use MuXodious/Luna-7B-A4B-absolute-heresy with Docker Model Runner:
```
docker model run hf.co/MuXodious/Luna-7B-A4B-absolute-heresy
```
Browse Quantizations to use this model in llama.cpp, Ollama, LM Studio, or any compatible app.

This is a Luna-7B-A4B fine-tune, produced through P-E-W's Heretic (v1.2.0) abliteration engine with Magnitude-Preserving Orthogonal Ablation enabled.

Heretication Results

Score Metric	Value	Parameter	Value
Refusals	5/100	direction_index	18.60
KL Divergence	0.0516	attn.o_proj.max_weight	3.74
Initial Refusals	100/100	attn.o_proj.max_weight_position	29.32
		attn.o_proj.min_weight	2.35
		attn.o_proj.min_weight_distance	16.84
		mlp.down_proj.max_weight	2.69
		mlp.down_proj.max_weight_position	27.92
		mlp.down_proj.min_weight	0.34
		mlp.down_proj.min_weight_distance	8.89

Degree of Heretication

The Heresy Index weighs the resulting model's corruption by the process (KL Divergence) and its abolition of doctrine (Refusals) for a final verdict in classification.

Index Entry	Classification	Analysis
	Absolute Heresy	Less than 10/100 Refusals and 0.10 KL Divergence
	Tainted Heresy	Around 25-11/100 Refusals and/or -0.20-0.11 KL Divergence
	Impotent Heresy	Anything above 25/100 Refusals and 0.21 KL Divergence

Note: This is an arbitrary classification inspired by Warhammer 40K, having no tangible indication towards the model's performance.

This is preview MoE version

🌙 Luna-7B-A4B – Roleplay Chat Model

Luna is a conversational AI model designed for immersive roleplay (RP) and natural chatting.
It is fine-tuned to respond in a more engaging, character-driven style compared to standard instruction-tuned models.

Notes:

Optimized for roleplay-style conversations
Flexible: can be used for creative writing, storytelling, or character interactions
For best performance, you should describe the system prompt for your character.
This model also train on varius task such as math, code and tool calling (agent) hoping for better performance.

Support me at:

Cite:

@misc{Luna,
  title        = {Luna-7B-A4B – Roleplay Chat Model},
  author       = {Beyoru},
  year         = {2025},
  howpublished = {\url{https://huggingface.co/beyoru/Luna}}
}