Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
350.2
TFLOPS
7
Yoshinobu Ohtani
y-ohtani
Follow
shtefcs's profile picture
yustudiojp's profile picture
2 followers
·
2 following
AI & ML interests
None yet
Recent Activity
updated
a model
4 days ago
y-ohtani/GRPO-TCR-Qwen3-4B-epoch3
published
a model
4 days ago
y-ohtani/GRPO-TCR-Qwen3-4B-epoch3
updated
a model
4 days ago
y-ohtani/GRPO-TCR-Qwen3-4B-epoch2
View all activity
Organizations
None yet
models
24
Sort: Recently updated
y-ohtani/GRPO-TCR-Qwen3-4B-epoch3
Reinforcement Learning
•
4B
•
Updated
4 days ago
•
34
y-ohtani/GRPO-TCR-Qwen3-4B-epoch2
Reinforcement Learning
•
4B
•
Updated
4 days ago
•
22
y-ohtani/GRPO-TCR-Qwen3-4B-epoch1
Reinforcement Learning
•
Updated
4 days ago
•
29
y-ohtani/GRPO-TCR-Qwen3-4B
4B
•
Updated
7 days ago
•
14
y-ohtani/GRPO-TCR-Qwen3-4B-step400
4B
•
Updated
7 days ago
•
11
y-ohtani/qwen3-4b-ra-sft-epoch5
4B
•
Updated
7 days ago
•
17
y-ohtani/qwen3-4b-ra-sft-epoch4
4B
•
Updated
7 days ago
•
13
y-ohtani/qwen3-4b-ra-sft-epoch3
4B
•
Updated
7 days ago
•
18
y-ohtani/Qwen3-4B-Instruct-2507_SFT2
Text Generation
•
4B
•
Updated
8 days ago
y-ohtani/qwen3-4b-ra-sft-merged
Reinforcement Learning
•
4B
•
Updated
8 days ago
•
17
View 24 models
datasets
1
y-ohtani/open_agentrl_like_sft
Viewer
•
Updated
13 days ago
•
2k
•
10