AlignmentResearch
/

obfuscation-atlas-Meta-Llama-3-8B-Instruct-kl0.0001-det1-seed1-diverse_deception_probe

deception-detection

alignment-research

obfuscation-atlas

model-type:obfuscated-policy

op-type:strategic-honesty

Model card Files Files and versions

obfuscation-atlas-Meta-Llama-3-8B-Instruct-kl0.0001-det1-seed1-diverse_deception_probe

689 MB

Ctrl+K

Ctrl+K

1 contributor

History: 4 commits

taufeeque's picture

Upload README.md with huggingface_hub

88a6e00 verified 3 months ago

.gitattributes

1.57 kB
Upload folder using huggingface_hub 4 months ago
README.md

3.42 kB
Upload README.md with huggingface_hub 3 months ago
adapter_config.json

948 Bytes
Upload folder using huggingface_hub 4 months ago
adapter_model.bin
Detected Pickle imports (3)
- "torch._utils._rebuild_tensor_v2",
- "torch.FloatStorage",
- "collections.OrderedDict"
What is a pickle import?
671 MB
xet

Upload folder using huggingface_hub 4 months ago
chat_template.jinja

389 Bytes
Upload folder using huggingface_hub 4 months ago
special_tokens_map.json

343 Bytes
Upload folder using huggingface_hub 4 months ago
tokenizer.json

17.2 MB
xet

Upload folder using huggingface_hub 4 months ago
tokenizer_config.json

50.6 kB
Upload folder using huggingface_hub 4 months ago