Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
diagonalge
/
texttestgrpo
like
0
Text Generation
Transformers
Safetensors
llama
Generated from Trainer
conversational
text-generation-inference
arxiv:
2402.03300
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
texttestgrpo
/
checkpoint-1
51.6 MB
1 contributor
History:
1 commit
diagonalge
Upload task output testgrpo
4b3c951
verified
6 months ago
README.md
5.13 kB
Upload task output testgrpo
6 months ago
adapter_config.json
891 Bytes
Upload task output testgrpo
6 months ago
adapter_model.safetensors
22.6 MB
xet
Upload task output testgrpo
6 months ago
chat_template.jinja
485 Bytes
Upload task output testgrpo
6 months ago
optimizer.pt
11.7 MB
xet
Upload task output testgrpo
6 months ago
rng_state.pth
14.2 kB
xet
Upload task output testgrpo
6 months ago
scheduler.pt
1.06 kB
xet
Upload task output testgrpo
6 months ago
special_tokens_map.json
459 Bytes
Upload task output testgrpo
6 months ago
tokenizer.json
17.2 MB
xet
Upload task output testgrpo
6 months ago
tokenizer_config.json
50.6 kB
Upload task output testgrpo
6 months ago
trainer_state.json
1.68 kB
Upload task output testgrpo
6 months ago
training_args.bin
7.86 kB
xet
Upload task output testgrpo
6 months ago