Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
350.2
TFLOPS
7
Yoshinobu Ohtani
y-ohtani
Follow
shtefcs's profile picture
yustudiojp's profile picture
dark-pen's profile picture
3 followers
·
2 following
AI & ML interests
None yet
Organizations
None yet
models
24
Sort: Recently updated
y-ohtani/GRPO-TCR-Qwen3-4B-True-epoch3
Reinforcement Learning
•
4B
•
Updated
Mar 7
•
7
y-ohtani/GRPO-TCR-Qwen3-4B-True-epoch2
Reinforcement Learning
•
4B
•
Updated
Mar 7
•
3
y-ohtani/GRPO-TCR-Qwen3-4B-True-epoch1
Reinforcement Learning
•
4B
•
Updated
Mar 7
•
4
y-ohtani/qwen3-4b-agent-sft-true
Text Generation
•
4B
•
Updated
Mar 2
•
15
•
•
1
y-ohtani/GRPO-TCR-Qwen3-4B-test
Text Generation
•
4B
•
Updated
Mar 2
•
5
•
y-ohtani/GRPO-TCR-Qwen3-4B-quarter
Text Generation
•
4B
•
Updated
Mar 1
•
3
y-ohtani/qwen3-4b-grpo-tcr-agent
Text Generation
•
4B
•
Updated
Mar 1
•
4
•
2
y-ohtani/Qwen3-4B-Instruct-2507_16-1_global_step_744
Text Generation
•
4B
•
Updated
Feb 27
•
1
y-ohtani/qwen3-4b-ra-sft-merged
Text Generation
•
4B
•
Updated
Feb 27
•
2
y-ohtani/qwen3-4b-ra-sft-epoch5
Text Generation
•
4B
•
Updated
Feb 27
•
5
View 24 models
datasets
2
Sort: Recently updated
y-ohtani/open_agentrl_grpo_2k
Viewer
•
Updated
Feb 28
•
2k
•
41
•
1
y-ohtani/open_agentrl_like_sft
Viewer
•
Updated
Feb 13
•
2k
•
27