Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Lansechen
/
deepseek-v2-lite-16b-chat-R1-Distill-batch16-lora-numinamath
like
1
Text Generation
Transformers
Safetensors
AI-MO/NuminaMath-TIR
deepseek_v2
Generated from Trainer
open-r1
trl
sft
conversational
custom_code
text-generation-inference
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
deepseek-v2-lite-16b-chat-R1-Distill-batch16-lora-numinamath
12 MB
1 contributor
History:
3 commits
Lansechen
End of training
058917d
verified
about 1 year ago
.gitattributes
1.52 kB
initial commit
about 1 year ago
README.md
2.02 kB
End of training
about 1 year ago
adapter_config.json
744 Bytes
Model save
about 1 year ago
adapter_model.safetensors
4.43 MB
xet
Model save
about 1 year ago
all_results.json
213 Bytes
Model save
about 1 year ago
config.json
1.74 kB
End of training
about 1 year ago
special_tokens_map.json
369 Bytes
Model save
about 1 year ago
tokenizer.json
7.5 MB
Model save
about 1 year ago
tokenizer_config.json
1.37 kB
Model save
about 1 year ago
train_results.json
213 Bytes
Model save
about 1 year ago
trainer_state.json
20 kB
Model save
about 1 year ago
training_args.bin
7.54 kB
xet
Model save
about 1 year ago