Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
THU-KEG
/
WildReward-4B
like
3
Follow
Knowledge Engineer Group @ Tsinghua University
110
Text Classification
Transformers
Safetensors
qwen3
reward-model
rlhf
dpo
alignment
wildchat
text-embeddings-inference
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
WildReward-4B
Commit History
Update README.md
ec67a1c
verified
Wesleythu
commited on
4 days ago
Update README.md
261701e
verified
Wesleythu
commited on
4 days ago
Create README.md
8545052
verified
Wesleythu
commited on
4 days ago
Upload vocab.json
74ec0c9
verified
Wesleythu
commited on
Dec 28, 2025
Upload training_args.bin
acc2c5f
verified
Wesleythu
commited on
Dec 28, 2025
Upload tokenizer_config.json
d4cfef0
verified
Wesleythu
commited on
Dec 28, 2025
Upload tokenizer.json
6539328
verified
Wesleythu
commited on
Dec 28, 2025
Upload special_tokens_map.json
44adc1b
verified
Wesleythu
commited on
Dec 28, 2025
Upload model.safetensors.index.json
5f2b86c
verified
Wesleythu
commited on
Dec 28, 2025
Upload model-00002-of-00002.safetensors
2e463c4
verified
Wesleythu
commited on
Dec 28, 2025
Upload model-00001-of-00002.safetensors
23a7b51
verified
Wesleythu
commited on
Dec 28, 2025
Upload merges.txt
acbb458
verified
Wesleythu
commited on
Dec 28, 2025
Upload config.json
93bf6eb
verified
Wesleythu
commited on
Dec 28, 2025
Upload chat_template.jinja
aaaa405
verified
Wesleythu
commited on
Dec 28, 2025
Upload added_tokens.json
e352bc6
verified
Wesleythu
commited on
Dec 28, 2025
initial commit
beb4297
verified
Wesleythu
commited on
Dec 23, 2025