umer07
/

fathom-mixtral

malware-analysis

threat-intelligence

expert-adapters

digital-forensics

Model card Files Files and versions

umer07 commited on Apr 4

Commit

662fc41

·

verified ·

1 Parent(s): b9d9d3f

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -33,7 +33,7 @@ metrics:
 ## Model Overview
 - **Base:** Mixtral-8x7B-Instruct-v0.1 (full bf16, no quantization)
-- **Training:** Direct PEFT+TRL (LlamaFactory dropped due to ROCm issues)
 - **Adapters:** 1 unified + 9 expert LoRA adapters (all rank=32, α=16)
 - **Hardware:** AMD MI300X (205.8 GB VRAM) — full bf16 training
 - **Key Innovation:** Evidence extraction layer + structured behavioral prompts → **9× improvement** in real ATT&CK mapping

 ## Model Overview
 - **Base:** Mixtral-8x7B-Instruct-v0.1 (full bf16, no quantization)
+- **Training:** Direct PEFT+TRL
 - **Adapters:** 1 unified + 9 expert LoRA adapters (all rank=32, α=16)
 - **Hardware:** AMD MI300X (205.8 GB VRAM) — full bf16 training
 - **Key Innovation:** Evidence extraction layer + structured behavioral prompts → **9× improvement** in real ATT&CK mapping