Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
350.2
TFLOPS
7
Yoshinobu Ohtani
y-ohtani
Follow
dark-pen's profile picture
shtefcs's profile picture
yustudiojp's profile picture
3 followers
·
2 following
AI & ML interests
None yet
Recent Activity
updated
a model
about 1 month ago
y-ohtani/GRPO-TCR-Qwen3-4B-True-epoch3
published
a model
about 1 month ago
y-ohtani/GRPO-TCR-Qwen3-4B-True-epoch3
updated
a model
about 1 month ago
y-ohtani/GRPO-TCR-Qwen3-4B-True-epoch1
View all activity
Organizations
None yet
y-ohtani
's activity
All
Models
Datasets
Spaces
Buckets
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
updated
a model
about 1 month ago
y-ohtani/GRPO-TCR-Qwen3-4B-True-epoch3
Reinforcement Learning
•
4B
•
Updated
Mar 7
•
23
published
a model
about 1 month ago
y-ohtani/GRPO-TCR-Qwen3-4B-True-epoch3
Reinforcement Learning
•
4B
•
Updated
Mar 7
•
23
updated
2 models
about 1 month ago
y-ohtani/GRPO-TCR-Qwen3-4B-True-epoch1
Reinforcement Learning
•
4B
•
Updated
Mar 7
•
24
y-ohtani/GRPO-TCR-Qwen3-4B-True-epoch2
Reinforcement Learning
•
4B
•
Updated
Mar 7
•
23
published
2 models
about 1 month ago
y-ohtani/GRPO-TCR-Qwen3-4B-True-epoch2
Reinforcement Learning
•
4B
•
Updated
Mar 7
•
23
y-ohtani/GRPO-TCR-Qwen3-4B-True-epoch1
Reinforcement Learning
•
4B
•
Updated
Mar 7
•
24
updated
a model
about 1 month ago
y-ohtani/qwen3-4b-agent-sft-true
Text Generation
•
4B
•
Updated
Mar 2
•
40
•
1
published
a model
about 1 month ago
y-ohtani/qwen3-4b-agent-sft-true
Text Generation
•
4B
•
Updated
Mar 2
•
40
•
1
updated
2 models
about 1 month ago
y-ohtani/GRPO-TCR-Qwen3-4B-test
Text Generation
•
4B
•
Updated
Mar 2
•
31
y-ohtani/qwen3-4b-grpo-tcr-agent
Text Generation
•
4B
•
Updated
Mar 1
•
19
•
2
published
a model
about 1 month ago
y-ohtani/qwen3-4b-grpo-tcr-agent
Text Generation
•
4B
•
Updated
Mar 1
•
19
•
2
updated
a model
about 1 month ago
y-ohtani/GRPO-TCR-Qwen3-4B-quarter
Text Generation
•
4B
•
Updated
Mar 1
•
12
published
a model
about 1 month ago
y-ohtani/GRPO-TCR-Qwen3-4B-quarter
Text Generation
•
4B
•
Updated
Mar 1
•
12
published
a model
about 2 months ago
y-ohtani/GRPO-TCR-Qwen3-4B-test
Text Generation
•
4B
•
Updated
Mar 2
•
31
updated
a dataset
about 2 months ago
y-ohtani/open_agentrl_grpo_2k
Viewer
•
Updated
Feb 28
•
2k
•
63
•
1
published
a dataset
about 2 months ago
y-ohtani/open_agentrl_grpo_2k
Viewer
•
Updated
Feb 28
•
2k
•
63
•
1
updated
4 models
about 2 months ago
y-ohtani/Qwen3-4B-Instruct-2507_16-1_global_step_744
Text Generation
•
4B
•
Updated
Feb 27
•
2
y-ohtani/qwen3-4b-ra-sft-merged
Text Generation
•
4B
•
Updated
Feb 27
•
3
y-ohtani/qwen3-4b-ra-sft-epoch5
Text Generation
•
4B
•
Updated
Feb 27
•
2
y-ohtani/qwen3-4b-ra-sft-epoch4
Text Generation
•
4B
•
Updated
Feb 27
•
1
Load more