--- license: apache-2.0 language: - en tags: - control-vector - creative-writing base_model: - zai-org/GLM-4.6 base_model_relation: adapter pipeline_tag: text-generation library_name: transformers --- ## gghfez/GLM-4.6-control-vectors Creative Writing control-vectors for [zai-org/GLM-4.6](https://huggingface.co/zai-org/GLM-4.6) Feedback is welcome and would be very helpful. ## Usage Apply the *debias vector* and either the *positive* **or** *negative* vector when starting llama-server. If both are applied, they will cancel each other out. You can use either `--control-vector [/path/to/vector.gguf]` or --control-vector-scaled [/path/to/vector.gguf] [scale factor] The *debias* vector must be set to `1.0` IMPORTANT: The positive and negative axis control vectors must be used along with the relevant de-bias control vector - they cannot be used on their own! ### Llama.cpp / IK_Llama.cpp Example **Creative writing** ```sh llama-server --model GLM-4.6-UD-IQ2_XXS-00001-of-00003.gguf [your usual CLI arguments] \ --control-vector-scaled glm-4.6_honesty_vs_machiavellianism__debias.gguf 1.0 \ --control-vector-scaled glm-4.6_honesty_vs_machiavellianism__machiavellianism.gguf 1.0 \ ``` **Creative Writing without reasoning** ```sh llama-server --model GLM-4.6-UD-IQ2_XXS-00001-of-00003.gguf [your usual CLI arguments] \ --chat-template-kwargs '{"enable_thinking": false}' \ --control-vector-scaled glm-4.6_honesty_vs_machiavellianism__debias.gguf 1.0 \ --control-vector-scaled glm-4.6_honesty_vs_machiavellianism__machiavellianism.gguf 1.0 \ ``` **Assistant** ```sh llama-server --model GLM-4.6-IQ3_KS-00001-of-00004.gguf [your usual CLI arguments] \ --control-vector-scaled glm-4.6_communication__debias.gguf 1.0 \ --control-vector-scaled glm-4.6_communication__direct_communication.gguf 1.0 \ ``` ### Limitations With reasoning enabled on extreme quants like IQ2_XXS, very simple prompts like "Hi" may result in irrelevant replies.