Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
ziadrone
/
airesupdated-v6
like
1
Text Generation
PEFT
Safetensors
English
reasoning
tree-of-thoughts
dpo
grpo
mathematics
rlhf
lora
conversational
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
Use this model
main
airesupdated-v6
/
tokenizer.json
Commit History
Upload ToT-GRPO adapter
57ef86f
verified
ziadrone
commited on
Nov 5, 2025