Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up
Yoshinobu Ohtani's picture
7

Yoshinobu Ohtani

y-ohtani
shtefcs's profile picture yustudiojp's profile picture dark-pen's profile picture
·

AI & ML interests

None yet

Organizations

None yet

models 24

y-ohtani/GRPO-TCR-Qwen3-4B-True-epoch3

Reinforcement Learning • 4B • Updated Mar 7 • 7

y-ohtani/GRPO-TCR-Qwen3-4B-True-epoch2

Reinforcement Learning • 4B • Updated Mar 7 • 3

y-ohtani/GRPO-TCR-Qwen3-4B-True-epoch1

Reinforcement Learning • 4B • Updated Mar 7 • 4

y-ohtani/qwen3-4b-agent-sft-true

Text Generation • 4B • Updated Mar 2 • 15 • • 1

y-ohtani/GRPO-TCR-Qwen3-4B-test

Text Generation • 4B • Updated Mar 2 • 5 •

y-ohtani/GRPO-TCR-Qwen3-4B-quarter

Text Generation • 4B • Updated Mar 1 • 3

y-ohtani/qwen3-4b-grpo-tcr-agent

Text Generation • 4B • Updated Mar 1 • 4 • 2

y-ohtani/Qwen3-4B-Instruct-2507_16-1_global_step_744

Text Generation • 4B • Updated Feb 27 • 1

y-ohtani/qwen3-4b-ra-sft-merged

Text Generation • 4B • Updated Feb 27 • 2

y-ohtani/qwen3-4b-ra-sft-epoch5

Text Generation • 4B • Updated Feb 27 • 5
View 24 models

datasets 2

y-ohtani/open_agentrl_grpo_2k

Viewer • Updated Feb 28 • 2k • 41 • 1

y-ohtani/open_agentrl_like_sft

Viewer • Updated Feb 13 • 2k • 27
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs