Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
350.2
TFLOPS
7
Yoshinobu Ohtani
y-ohtani
Follow
yustudiojp's profile picture
shtefcs's profile picture
2 followers
·
2 following
AI & ML interests
None yet
Recent Activity
updated
a model
5 days ago
y-ohtani/GRPO-TCR-Qwen3-4B-epoch3
published
a model
5 days ago
y-ohtani/GRPO-TCR-Qwen3-4B-epoch3
updated
a model
5 days ago
y-ohtani/GRPO-TCR-Qwen3-4B-epoch2
View all activity
Organizations
None yet
y-ohtani
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
updated
a model
5 days ago
y-ohtani/GRPO-TCR-Qwen3-4B-epoch3
Reinforcement Learning
•
4B
•
Updated
5 days ago
•
34
published
a model
5 days ago
y-ohtani/GRPO-TCR-Qwen3-4B-epoch3
Reinforcement Learning
•
4B
•
Updated
5 days ago
•
34
updated
a model
5 days ago
y-ohtani/GRPO-TCR-Qwen3-4B-epoch2
Reinforcement Learning
•
4B
•
Updated
5 days ago
•
22
published
a model
5 days ago
y-ohtani/GRPO-TCR-Qwen3-4B-epoch2
Reinforcement Learning
•
4B
•
Updated
5 days ago
•
22
updated
a model
5 days ago
y-ohtani/GRPO-TCR-Qwen3-4B-epoch1
Reinforcement Learning
•
Updated
5 days ago
•
29
published
a model
5 days ago
y-ohtani/GRPO-TCR-Qwen3-4B-epoch1
Reinforcement Learning
•
Updated
5 days ago
•
29
published
a dataset
5 days ago
y-ohtani/open_agentrl_like_sft
Viewer
•
Updated
13 days ago
•
2k
•
10
updated
a model
7 days ago
y-ohtani/GRPO-TCR-Qwen3-4B
4B
•
Updated
7 days ago
•
14
published
a model
7 days ago
y-ohtani/GRPO-TCR-Qwen3-4B
4B
•
Updated
7 days ago
•
14
updated
a model
7 days ago
y-ohtani/GRPO-TCR-Qwen3-4B-step400
4B
•
Updated
7 days ago
•
11
published
a model
7 days ago
y-ohtani/GRPO-TCR-Qwen3-4B-step400
4B
•
Updated
7 days ago
•
11
updated
a model
7 days ago
y-ohtani/qwen3-4b-ra-sft-epoch5
4B
•
Updated
7 days ago
•
17
published
a model
7 days ago
y-ohtani/qwen3-4b-ra-sft-epoch5
4B
•
Updated
7 days ago
•
17
updated
a model
7 days ago
y-ohtani/qwen3-4b-ra-sft-epoch4
4B
•
Updated
7 days ago
•
13
published
a model
7 days ago
y-ohtani/qwen3-4b-ra-sft-epoch4
4B
•
Updated
7 days ago
•
13
updated
a model
7 days ago
y-ohtani/qwen3-4b-ra-sft-epoch3
4B
•
Updated
7 days ago
•
18
published
a model
7 days ago
y-ohtani/qwen3-4b-ra-sft-epoch3
4B
•
Updated
7 days ago
•
18
updated
a model
8 days ago
y-ohtani/Qwen3-4B-Instruct-2507_SFT2
Text Generation
•
4B
•
Updated
8 days ago
published
2 models
8 days ago
y-ohtani/Qwen3-4B-Instruct-2507_SFT2
Text Generation
•
4B
•
Updated
8 days ago
y-ohtani/qwen3-4b-ra-sft-merged
Reinforcement Learning
•
4B
•
Updated
8 days ago
•
17
Load more