Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Schrieffer
/
Llama-SARM-4B
like
1
Reinforcement Learning
Transformers
Safetensors
llama
text-classification
reward-model
rlhf
sparse-autoencoder
interpretability
custom_code
text-generation-inference
arxiv:
2508.08746
License:
llama3.1
Model card
Files
Files and versions
xet
Community
1
Deploy
Use this model
refs/pr/1
Llama-SARM-4B
9.11 GB
4 contributors
History:
14 commits
nielsr
HF Staff
Add library_name and pipeline_tag to metadata
2b05504
verified
3 months ago
.gitattributes
1.52 kB
initial commit
5 months ago
README.md
2.39 kB
Add library_name and pipeline_tag to metadata
3 months ago
config.json
1.25 kB
add remote code
4 months ago
model-00001-of-00002.safetensors
4.98 GB
xet
Initial commit of the reward model
5 months ago
model-00002-of-00002.safetensors
4.13 GB
xet
Initial commit of the reward model
5 months ago
model.safetensors.index.json
12.3 kB
Initial commit of the reward model
5 months ago
modeling_sarm_gemma2.py
20 kB
add remote code
4 months ago
modeling_sarm_llama.py
23.7 kB
add remote code
4 months ago
special_tokens_map.json
434 Bytes
Add tokenizer
5 months ago
tokenizer.json
9.09 MB
Add tokenizer
5 months ago
tokenizer_config.json
55.6 kB
add remote code
4 months ago