Hiroshi Yoshihara

RabotniKuma

·

AI & ML interests

None yet

Recent Activity

liked a dataset 6 days ago

genshiai-daichi/med-slm-ja-before-after

liked a dataset about 2 months ago

SakanaAI/FishMath-SFT-Data

liked a model 2 months ago

sbintuitions/sarashina2.2-tts

View all activity

Organizations

liked a dataset 6 days ago

genshiai-daichi/med-slm-ja-before-after

Viewer • Updated 23 days ago • 46.7k • 113 • 2

liked a dataset about 2 months ago

SakanaAI/FishMath-SFT-Data

Viewer • Updated May 8 • 23.3k • 75 • 3

liked a model 2 months ago

sbintuitions/sarashina2.2-tts

Text-to-Speech • 0.8B • Updated 6 days ago • 59.6k • 63

upvoted a paper 4 months ago

Nemotron-Cascade 2: Post-Training LLMs with Cascade RL and Multi-Domain On-Policy Distillation

Paper • 2603.19220 • Published Mar 19 • 70

liked 2 models 9 months ago

deepseek-ai/DeepSeek-OCR

Image-Text-to-Text • 3B • Updated Nov 4, 2025 • 2.44M • 3.3k

unsloth/gpt-oss-20b-BF16

Text Generation • 21B • Updated Aug 5, 2025 • 95k • 34

liked a model 10 months ago

Qwen/Qwen3-Next-80B-A3B-Instruct

Text Generation • 81B • Updated Sep 17, 2025 • 246k • • 1.03k

authored a paper 11 months ago

A Practical Two-Stage Recipe for Mathematical LLMs: Maximizing Accuracy with SFT and Efficiency with Reinforcement Learning

Paper • 2507.08267 • Published Jul 11, 2025 • 11

liked a model 11 months ago

openai/gpt-oss-20b

Text Generation • 22B • Updated Aug 26, 2025 • 6.96M • • 4.77k

upvoted 2 papers 12 months ago

Wider or Deeper? Scaling LLM Inference-Time Compute with Adaptive Branching Tree Search

Paper • 2503.04412 • Published Mar 6, 2025 • 6

A Practical Two-Stage Recipe for Mathematical LLMs: Maximizing Accuracy with SFT and Efficiency with Reinforcement Learning

Paper • 2507.08267 • Published Jul 11, 2025 • 11

updated a collection 12 months ago

Fast-Math

Fast-Math is a model series designed to significantly improve inference efficiency while preserving accuracy on math reasoning tasks. • 7 items • Updated Jul 15, 2025

New activity in RabotniKuma/Fast-Math-R1-Token-Scheduler 12 months ago

Improve dataset card for Token Scheduler Dataset: Add paper link and detailed description

#2 opened 12 months ago by

New activity in RabotniKuma/Fast-Math-R1-GRPO 12 months ago

Improve dataset card: Add paper, code, metadata, and usage

#1 opened 12 months ago by

New activity in RabotniKuma/Fast-Math-R1-SFT 12 months ago

Enhance dataset card: Add paper link, metadata, and usage

#2 opened 12 months ago by

New activity in RabotniKuma/Fast-Math-Qwen3-14B 12 months ago

Improve model card: Add pipeline tag, library name, and link to paper

#1 opened 12 months ago by

New activity in RabotniKuma/Fast-OpenMath-Nemotron-14B 12 months ago

Enhance model card with metadata, paper link, and project page

#1 opened 12 months ago by

New activity in RabotniKuma/Fast-Math-R1-14B 12 months ago

Improve model card: Add pipeline tag, library name, update paper link and enhance details

#1 opened 12 months ago by

liked a model 12 months ago

marcelbinz/Llama-3.1-Centaur-70B-adapter

Updated Jul 1, 2025 • 177

updated a model about 1 year ago

RabotniKuma/Fast-Math-R1-14B-2

15B • Updated May 20, 2025 • 3