Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
wsabreu
/
Tupi-Think-2b4
like
0
Text Generation
PEFT
Safetensors
Transformers
grpo
lora
trl
unsloth
conversational
arxiv:
1910.09700
Model card
Files
Files and versions
xet
Community
Use this model
main
Tupi-Think-2b4
258 MB
1 contributor
History:
3 commits
wsabreu
Upload folder using huggingface_hub
5a5ee1c
verified
30 days ago
.gitattributes
1.52 kB
initial commit
30 days ago
README.md
5.22 kB
Upload folder using huggingface_hub
30 days ago
adapter_config.json
1.07 kB
Upload folder using huggingface_hub
30 days ago
adapter_model.safetensors
169 MB
xet
Upload LlamaForCausalLM
30 days ago
chat_template.jinja
353 Bytes
Upload folder using huggingface_hub
30 days ago
generation_config.json
240 Bytes
Upload LlamaForCausalLM
30 days ago
optimizer.pt
86.3 MB
xet
Upload folder using huggingface_hub
30 days ago
rng_state.pth
14.6 kB
xet
Upload folder using huggingface_hub
30 days ago
scheduler.pt
1.47 kB
xet
Upload folder using huggingface_hub
30 days ago
special_tokens_map.json
608 Bytes
Upload folder using huggingface_hub
30 days ago
tokenizer.json
2.27 MB
Upload folder using huggingface_hub
30 days ago
tokenizer_config.json
3.3 kB
Upload folder using huggingface_hub
30 days ago
trainer_state.json
96.3 kB
Upload folder using huggingface_hub
30 days ago
training_args.bin
7.31 kB
xet
Upload folder using huggingface_hub
30 days ago