Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
memo-ozdincer
/
rrfa-runs
like
0
Text Generation
PEFT
Safetensors
English
llama
llama-3.1
lora
circuit-breakers
representation-rerouting
ai-safety
prompt-injection
tool-calling
agentic
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
Use this model
main
rrfa-runs
1 contributor
History:
2 commits
memo-ozdincer
Model Card, explaining new LMP and MWCS policies implemented (from Jan 18 group meeting)
92593eb
about 2 months ago
runs
LoRA RR adapters for Llama 3.1 trained on Tool-flip only Loss Masking Policy & Low-weight Mixture Weighting & Curriculum Schedules calculated by perplexity/cross-entropy on fixed token-window at the point of prompt injection. Artifacts of run #208788. Full performance eval uploaded soon
about 2 months ago
.gitattributes
Safe
255 Bytes
LoRA RR adapters for Llama 3.1 trained on Tool-flip only Loss Masking Policy & Low-weight Mixture Weighting & Curriculum Schedules calculated by perplexity/cross-entropy on fixed token-window at the point of prompt injection. Artifacts of run #208788. Full performance eval uploaded soon
about 2 months ago
README.md
Safe
6.85 kB
Model Card, explaining new LMP and MWCS policies implemented (from Jan 18 group meeting)
about 2 months ago