ML Foundations Development

non-profit

https://github.com/mlfoundations

AI & ML interests

None defined yet.

Recent Activity

Jaehun authored a paper about 19 hours ago

LLM-as-a-Tutor: Policy-Aware Prompt Adaptation for Non-Verifiable RL

Jaehun authored a paper about 19 hours ago

How to Instruct Your Robot: Dense Language Annotations Power Robot Policy Learning

Jaehun authored a paper about 19 hours ago

ProCUA-SFT Technical Report

View all activity

authored 7 papers about 19 hours ago

LLM-as-a-Tutor: Policy-Aware Prompt Adaptation for Non-Verifiable RL

Paper • 2607.04412 • Published 16 days ago • 34

How to Instruct Your Robot: Dense Language Annotations Power Robot Policy Learning

Paper • 2605.17077 • Published May 16 • 1

ProCUA-SFT Technical Report

Paper • 2606.17321 • Published Jun 15 • 10

Cosmos 3: Omnimodal World Models for Physical AI

Paper • 2606.02800 • Published Jun 1 • 140

Nemotron 3 Nano Omni: Efficient and Open Multimodal Intelligence

Paper • 2604.24954 • Published Apr 27 • 26

Privasis: Synthesizing the Largest "Public" Private Dataset from Scratch

Paper • 2602.03183 • Published Feb 3 • 13

Golden Goose: A Simple Trick to Synthesize Unlimited RLVR Tasks from Unverifiable Internet Text

Paper • 2601.22975 • Published Jan 30 • 113

authored a paper 24 days ago

OpenThoughts-Agent: Data Recipes for Agentic Models

Paper • 2606.24855 • Published 28 days ago • 47

authored a paper 26 days ago

OpenThoughts-Agent: Data Recipes for Agentic Models

Paper • 2606.24855 • Published 28 days ago • 47

published 6 datasets about 2 months ago

mlfoundations-dev/sachin_khanacademy_web_scraped_questions

Updated Oct 25, 2024 • 118

mlfoundations-dev/slim-orca

Viewer • Updated Oct 28, 2024 • 260k • 59

mlfoundations-dev/subsampled_flan_v2_w_system_instructions

Viewer • Updated Oct 14, 2024 • 3.56M • 41

mlfoundations-dev/subsampled_flan_v2

Viewer • Updated Oct 10, 2024 • 3.51M • 21

mlfoundations-dev/TIGER-Lab-WebInstructFull-copy

Viewer • Updated Oct 20, 2024 • 11.6M • 20

mlfoundations-dev/open-orca

Preview • Updated Oct 18, 2024 • 7

authored 5 papers 3 months ago

DWIM: Towards Tool-aware Visual Reasoning via Discrepancy-aware Workflow Generation & Instruct-Masking Tuning

Paper • 2503.19263 • Published Mar 25, 2025 • 2

Executable Functional Abstractions: Inferring Generative Programs for Advanced Math Problems

Paper • 2504.09763 • Published Apr 14, 2025 • 12

OpenThoughts: Data Recipes for Reasoning Models

Paper • 2506.04178 • Published Jun 4, 2025 • 56

One Life to Learn: Inferring Symbolic World Models for Stochastic Environments from Unguided Exploration

Paper • 2510.12088 • Published Oct 14, 2025 • 5

PRInTS: Reward Modeling for Long-Horizon Information Seeking

Paper • 2511.19314 • Published Nov 24, 2025 • 8