Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
p1atdev
/
llm-jp-3-3.7b-instruct2-R27
like
0
Text Generation
Transformers
Safetensors
p1atdev/gsm8k-ja-slim
SyntheticVeryEasyMath5k
SyntheticWhichIsGreater5k
Japanese
llama
grpo
trl
conversational
text-generation-inference
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
llm-jp-3-3.7b-instruct2-R27
Commit History
Training in progress, step 240
385aa18
verified
p1atdev
commited on
Feb 13, 2025
Training in progress, step 220
3513951
verified
p1atdev
commited on
Feb 13, 2025
Training in progress, step 200
e4a7a05
verified
p1atdev
commited on
Feb 13, 2025
Training in progress, step 160
d8b5c4a
verified
p1atdev
commited on
Feb 13, 2025
Training in progress, step 140
5e0e422
verified
p1atdev
commited on
Feb 13, 2025
Training in progress, step 120
0ca8957
verified
p1atdev
commited on
Feb 13, 2025
Training in progress, step 100
64c1178
verified
p1atdev
commited on
Feb 13, 2025
Training in progress, step 80
2e3418d
verified
p1atdev
commited on
Feb 13, 2025
Training in progress, step 40
d3a0304
verified
p1atdev
commited on
Feb 13, 2025
Training in progress, step 20
3246e54
verified
p1atdev
commited on
Feb 13, 2025
initial commit
d4fa9a2
verified
p1atdev
commited on
Feb 13, 2025
Previous
1
...
4
5
6
Next