Yoshinobu Ohtani

y-ohtani

7

·

AI & ML interests

None yet

Recent Activity

updated a model 4 days ago

y-ohtani/qwen36_lora

published a model 4 days ago

y-ohtani/qwen36_lora

updated a model 5 months ago

y-ohtani/GRPO-TCR-Qwen3-4B-True-epoch3

View all activity

Organizations

None yet

models 25

y-ohtani/qwen36_lora

Updated 4 days ago

y-ohtani/GRPO-TCR-Qwen3-4B-True-epoch3

Reinforcement Learning • 4B • Updated Mar 7 • 2

y-ohtani/GRPO-TCR-Qwen3-4B-True-epoch2

Reinforcement Learning • 4B • Updated Mar 7 • 4

y-ohtani/GRPO-TCR-Qwen3-4B-True-epoch1

Reinforcement Learning • 4B • Updated Mar 7 • 3

y-ohtani/qwen3-4b-agent-sft-true

Text Generation • 4B • Updated Mar 2 • 19 • • 1

y-ohtani/GRPO-TCR-Qwen3-4B-test

Text Generation • 4B • Updated Mar 2 • 6 •

y-ohtani/GRPO-TCR-Qwen3-4B-quarter

Text Generation • 4B • Updated Mar 1 • 7

y-ohtani/qwen3-4b-grpo-tcr-agent

Text Generation • 4B • Updated Mar 1 • 4 • 2

y-ohtani/Qwen3-4B-Instruct-2507_16-1_global_step_744

Text Generation • 4B • Updated Feb 27 • 5

y-ohtani/qwen3-4b-ra-sft-merged

Text Generation • 4B • Updated Feb 27 • 4

datasets 2

y-ohtani/open_agentrl_grpo_2k

Viewer • Updated Feb 28 • 2k • 20 • 1

y-ohtani/open_agentrl_like_sft

Viewer • Updated Feb 13 • 2k • 11