Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
350.2
TFLOPS
7
Yoshinobu Ohtani
y-ohtani
Follow
dark-pen's profile picture
shtefcs's profile picture
yustudiojp's profile picture
3 followers
·
2 following
AI & ML interests
None yet
Recent Activity
updated
a model
about 1 month ago
y-ohtani/GRPO-TCR-Qwen3-4B-True-epoch3
published
a model
about 1 month ago
y-ohtani/GRPO-TCR-Qwen3-4B-True-epoch3
updated
a model
about 1 month ago
y-ohtani/GRPO-TCR-Qwen3-4B-True-epoch1
View all activity
Organizations
None yet
models
24
Sort: Recently updated
y-ohtani/GRPO-TCR-Qwen3-4B-True-epoch3
Reinforcement Learning
•
4B
•
Updated
Mar 7
•
23
y-ohtani/GRPO-TCR-Qwen3-4B-True-epoch2
Reinforcement Learning
•
4B
•
Updated
Mar 7
•
23
y-ohtani/GRPO-TCR-Qwen3-4B-True-epoch1
Reinforcement Learning
•
4B
•
Updated
Mar 7
•
24
y-ohtani/qwen3-4b-agent-sft-true
Text Generation
•
4B
•
Updated
Mar 2
•
40
•
1
y-ohtani/GRPO-TCR-Qwen3-4B-test
Text Generation
•
4B
•
Updated
Mar 2
•
31
y-ohtani/GRPO-TCR-Qwen3-4B-quarter
Text Generation
•
4B
•
Updated
Mar 1
•
12
y-ohtani/qwen3-4b-grpo-tcr-agent
Text Generation
•
4B
•
Updated
Mar 1
•
19
•
2
y-ohtani/Qwen3-4B-Instruct-2507_16-1_global_step_744
Text Generation
•
4B
•
Updated
Feb 27
•
2
y-ohtani/qwen3-4b-ra-sft-merged
Text Generation
•
4B
•
Updated
Feb 27
•
3
y-ohtani/qwen3-4b-ra-sft-epoch5
Text Generation
•
4B
•
Updated
Feb 27
•
2
View 24 models
datasets
2
Sort: Recently updated
y-ohtani/open_agentrl_grpo_2k
Viewer
•
Updated
Feb 28
•
2k
•
63
•
1
y-ohtani/open_agentrl_like_sft
Viewer
•
Updated
Feb 13
•
2k
•
6