Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
rkumar1999
/
Llama-3.1-8B-Instruct-Open-R1-Distill
like
0
Text Generation
Transformers
Safetensors
rkumar1999/numina-deepseek-r1-qwen-7b
llama
Generated from Trainer
open-r1
conversational
text-generation-inference
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
Llama-3.1-8B-Instruct-Open-R1-Distill
/
checkpoint-8
70.4 MB
1 contributor
History:
1 commit
rkumar1999
Upload trained model
ac8ba96
verified
12 months ago
global_step8
Upload trained model
12 months ago
README.md
5.11 kB
Upload trained model
12 months ago
adapter_config.json
754 Bytes
Upload trained model
12 months ago
adapter_model.safetensors
7.52 MB
xet
Upload trained model
12 months ago
latest
12 Bytes
Upload trained model
12 months ago
rng_state_0.pth
14.5 kB
xet
Upload trained model
12 months ago
rng_state_1.pth
14.5 kB
xet
Upload trained model
12 months ago
scheduler.pt
1.06 kB
xet
Upload trained model
12 months ago
special_tokens_map.json
325 Bytes
Upload trained model
12 months ago
tokenizer.json
17.2 MB
xet
Upload trained model
12 months ago
tokenizer_config.json
55.4 kB
Upload trained model
12 months ago
trainer_state.json
749 Bytes
Upload trained model
12 months ago
training_args.bin
7.35 kB
xet
Upload trained model
12 months ago
zero_to_fp32.py
29.2 kB
Upload trained model
12 months ago