Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
andrewlngdn
/
dsl-debug-7b-sft-rl
like
0
Text Generation
Safetensors
custom
English
qwen2
reinforcement-learning
grpo
tool-use
debugging
dsl
conversational
Eval Results (legacy)
License:
mit
Model card
Files
Files and versions
xet
Community
main
dsl-debug-7b-sft-rl
/
global_step_35
45.7 GB
Ctrl+K
Ctrl+K
1 contributor
History:
1 commit
andrewlngdn
Upload folder using huggingface_hub
ff325a6
verified
about 1 month ago
actor
Upload folder using huggingface_hub
about 1 month ago
data.pt
Safe
pickle
Detected Pickle imports (3)
"torch._utils._rebuild_tensor_v2"
,
"collections.OrderedDict"
,
"torch.ByteStorage"
What is a pickle import?
7.32 kB
xet
Upload folder using huggingface_hub
about 1 month ago