yotoshihiro

1 14 26

AI & ML interests

None yet

Recent Activity

upvoted an article about 2 months ago

Unlocking asynchronicity in continuous batching

upvoted an article 2 months ago

DeepSeek-V4: a million-token context that agents can actually use

liked a Space 2 months ago

AdithyaSK/rl-environments-guide

View all activity

Organizations

upvoted an article about 2 months ago

Article

Unlocking asynchronicity in continuous batching

ror, pcuenq, ariG23498

•

May 14

• 63

upvoted an article 2 months ago

Article

DeepSeek-V4: a million-token context that agents can actually use

burtenshaw

•

Apr 24

• 50

liked a Space 2 months ago

The ultimate guide to RL environments: building and scaling them in the LLM era

📝

198

Building and scaling RL environments for LLM training

upvoted a collection 2 months ago

DFlash

Collection

Block Diffusion for Flash Speculative Decoding • 23 items • Updated 16 days ago • 147

upvoted an article 5 months ago

Article

Tricks from OpenAI gpt-oss YOU 🫵 can use with transformers

ariG23498, sergiopaniego, reach-vb, pcuenq, ArthurZ, SaylorTwift, cyrilvallez

•

Sep 11, 2025

• 188

liked 2 Spaces 7 months ago

The Eiffel Tower Llama

📝

119

Explore the Eiffel Tower Llama experiment with open-source models

Evaluation Guidebook

📝

334

Explore LLM benchmark scores over time

liked a Space 9 months ago

The Smol Training Playbook

📚

3.24k

The secrets to building world-class LLMs

liked a dataset 11 months ago

miromind-ai/MiroVerse-v0.1

Viewer • Updated Jan 16 • 228k • 226 • 238

upvoted an article 11 months ago

Article

Accelerate ND-Parallel: A guide to Efficient Multi-GPU Training

smohammadi, siro1, winglian, marcsun13, djsaunde

•

Aug 8, 2025

• 99

liked a Space 12 months ago

LLM Embeddings Explained: A Visual and Intuitive Guide

🚀

356

How Language Models Turn Text into Meaning, From Traditional

upvoted 2 articles about 1 year ago

Article

Gotchas in Tokenizer Behavior Every Developer Should Know

qgallouedec

•

Apr 18, 2025

• 72

Article

Training Large Language Models with Interpreter Feedback using WebAssembly

axolotl-ai-co

•

Apr 3, 2025

• 14

liked a model over 1 year ago

allenai/olmOCR-7B-0225-preview

Image-Text-to-Text • 8B • Updated Aug 19, 2025 • 2.46k • 708

upvoted an article over 1 year ago

Article

What is test-time compute and how to scale it?

Kseniase

•

Feb 6, 2025

• 123

liked a model over 1 year ago

unsloth/r1-1776-GGUF

Text Generation • 671B • Updated Feb 19, 2025 • 568 • 103

liked 2 Spaces over 1 year ago

The Ultra-Scale Playbook

🌌

3.94k

The ultimate guide to training LLM on large GPU Clusters

AnyCoder

🏆

3.3k

Generate code snippets with AI for web and app frameworks

liked a model over 1 year ago

deepseek-ai/DeepSeek-V3

Text Generation • 685B • Updated Mar 27, 2025 • 1.07M • • 4.1k

upvoted a paper over 1 year ago

The Lessons of Developing Process Reward Models in Mathematical Reasoning

Paper • 2501.07301 • Published Jan 13, 2025 • 100

yotoshihiro

AI & ML interests

Recent Activity

Organizations

yotoshihiro's activity

Unlocking asynchronicity in continuous batching

DeepSeek-V4: a million-token context that agents can actually use

The ultimate guide to RL environments: building and scaling them in the LLM era

Tricks from OpenAI gpt-oss YOU 🫵 can use with transformers

The Eiffel Tower Llama

Evaluation Guidebook

The Smol Training Playbook

Accelerate ND-Parallel: A guide to Efficient Multi-GPU Training

LLM Embeddings Explained: A Visual and Intuitive Guide

Gotchas in Tokenizer Behavior Every Developer Should Know

Training Large Language Models with Interpreter Feedback using WebAssembly

What is test-time compute and how to scale it?

The Ultra-Scale Playbook

AnyCoder