scratchtoscale (Scratch to Scale)

posted an update about 15 hours ago

Post

51

@retrain-pipelines v0.2.0 is out !
I'm at Station F at My booth with GOSIM Paris 2026 today & tomorrow.
Come meet me for a live in-person demo and a chat !

woojun-jung

authored a paper 6 days ago

QEVA: A Reference-Free Evaluation Metric for Narrative Video Summarization with Multimodal Question Answering

Paper • 2604.24052 • Published 9 days ago

Aurelien-Morgan

posted an update 23 days ago

Post

213

Launching a workweek of @retrain-pipelines wheels.

Day #1 : Compose

4 replies

·

chansung

authored a paper 3 months ago

TAROT: Test-driven and Capability-adaptive Curriculum Reinforcement Fine-tuning for Code Generation with Large Language Models

Paper • 2602.15449 • Published Feb 17 • 7

chansung

submitted a paper to Daily Papers 3 months ago

TAROT: Test-driven and Capability-adaptive Curriculum Reinforcement Fine-tuning for Code Generation with Large Language Models

Paper • 2602.15449 • Published Feb 17 • 7

epappas

authored a paper 5 months ago

Incentivizing Permissionless Distributed Learning of LLMs

Paper • 2505.21684 • Published May 27, 2025 • 1

woojun-jung

authored a paper 5 months ago

Visual Funnel: Resolving Contextual Blindness in Multimodal Large Language Models

Paper • 2512.10362 • Published Dec 11, 2025 • 1

Aurelien-Morgan

posted an update 5 months ago

Post

367

Hey, I went to Hangzhou to talk about retrain-pipelines at the GOSIM Foundation's conference last september.
The recording just got released. Go check it out !
https://www.youtube.com/watch?v=nmrMachM5aM
Slides are there :
https://docs.google.com/presentation/d/1hnAzHJ0SbeAOtGJir-iH84RBtXT1OxVT/

2 replies

·

Saurav2023

published a Space 8 months ago

README

👀

DandinPower

authored 2 papers 9 months ago

MemAscend: System Memory Optimization for SSD-Offloaded LLM Fine-Tuning

Paper • 2505.23254 • Published May 29, 2025

Analysis and Optimized CXL-Attached Memory Allocation for Long-Context LLM Fine-Tuning

Paper • 2507.03305 • Published Jul 4, 2025

chansung

posted an update 10 months ago

Post

4776

YAML engineering becomes more and more important than ever from infra provisioning to model training (recipes).

Here, I built a simple editor first for @dstackai , and I will share the live endpoint this week. Let me know what you think about this approach.

Based on this approach, if people think this is useful, I am going to do the same thing for the LLM training recipes for popular frameworks such as Hugging Face open-r1, Axolotl, and so on. Let me hear.

a1zhang

authored 3 papers 11 months ago

posted an update 12 months ago

Post

467

Hey, I'll be presenting @retrain-pipelines and almighty function-calling at the Hugging Face Paris HQ, you guys.
Monday evening. Lightning-talk style. With AI Tinkerers.

Come hang !

https://paris.aitinkerers.org/p/ai-tinkerers-paris-ai21-labs-takeover-on-may-19th

https://huggingface.co/blog/Aurelien-Morgan/the-almighty-function-caller

Aurelien-Morgan

posted an update about 1 year ago

Post

3161

The Almighty function-caller

How would you like to build smart GenAi infrastructure ?
Give extensive tools memory to your edge agentic system,
And optimize the resources it takes to run yet a high-performance set of agents ?

We came up with a novel approach to function-calling at scale for smart companies and corporate-grade use-cases.

Read our full-fledged blog article on this here on Hugging Face :
https://huggingface.co/blog/Aurelien-Morgan/the-almighty-function-caller

Aurelien-Morgan

posted an update about 1 year ago

Post

681

retrain-pipelines 0.1.2 finally dropped. It comes with a hot Hugging Face Hub integration. Go check it out. We have 2 articles about it coming up. One already fully written so, be on the lookout !
@retrain-pipelines

Also, I'll be volunteering at GOSIM AI Paris 2025. If you're interested in chatting, hmu.

Aurelien-Morgan

posted an update about 1 year ago

Post

2012

Almost there !
https://test.pypi.org/project/test-010-retrain-pipelines/

chansung

posted an update about 1 year ago

Post

3959

simple guide on the recipe for GRPO on Open-R1 which is built on top of TRL

I think FastAPI wrapper of vLLM with WeightSyncWorker is pretty cool feature. Also, we have many predefined reward functions out of the box!

5 replies

·

AI & ML interests

Team members 279

scratchtoscale's activity

README