Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
enzii
/
agent-2048-007
like
0
Text Generation
PEFT
Safetensors
Transformers
grpo
lora
trl
unsloth
arxiv:
1910.09700
Model card
Files
Files and versions
xet
Community
Use this model
main
agent-2048-007
Commit History
Upload README.md with huggingface_hub
c76e4b0
verified
enzii
commited on
Aug 2, 2025
Upload training_args.bin with huggingface_hub
8791fdd
verified
enzii
commited on
Aug 2, 2025
Upload special_tokens_map.json with huggingface_hub
c6fc121
verified
enzii
commited on
Aug 2, 2025
Upload added_tokens.json with huggingface_hub
c4679a5
verified
enzii
commited on
Aug 2, 2025
Upload vocab.json with huggingface_hub
e983559
verified
enzii
commited on
Aug 2, 2025
Upload tokenizer.json with huggingface_hub
da37533
verified
enzii
commited on
Aug 2, 2025
Upload tokenizer_config.json with huggingface_hub
4b5a76d
verified
enzii
commited on
Aug 2, 2025
Upload adapter_model.safetensors with huggingface_hub
ef2b772
verified
enzii
commited on
Aug 2, 2025
Upload adapter_config.json with huggingface_hub
7d9676f
verified
enzii
commited on
Aug 2, 2025
initial commit
9152519
verified
enzii
commited on
Aug 2, 2025