Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
bimabk
/
environment_test_affine_qwen_7b_instruct
like
0
Text Generation
PEFT
Safetensors
Transformers
qwen2
grpo
lora
trl
conversational
text-generation-inference
arxiv:
1910.09700
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
environment_test_affine_qwen_7b_instruct
16.5 GB
1 contributor
History:
6 commits
This model has 1 file scanned as suspicious.
Show
files
bimabk
Upload task output 1
37821fe
verified
21 days ago
.gitattributes
1.57 kB
Upload task output 1
21 days ago
README.md
5.21 kB
Upload task output 1
21 days ago
adapter_config.json
1.04 kB
Upload task output 1
21 days ago
adapter_model.safetensors
1.29 GB
xet
Upload task output 1
21 days ago
chat_template.jinja
328 Bytes
Upload task output 1
21 days ago
config.json
1.37 kB
Upload task output 1
21 days ago
generation_config.json
38 Bytes
Upload task output 1
21 days ago
loss.txt
23 Bytes
Upload task output 1
21 days ago
model.safetensors
15.2 GB
xet
Upload task output 1
21 days ago
tokenizer.json
11.4 MB
xet
Upload task output 1
21 days ago
tokenizer_config.json
418 Bytes
Upload task output 1
21 days ago
trainer_state.json
47.1 kB
Upload task output 1
21 days ago
training_args.bin
7.19 kB
xet
Upload task output 1
21 days ago