Jarrod Barnes's picture

Jarrod Barnes PRO

Jarrodbarnes

·

https://dynamicalsystems.ai

AI & ML interests

Continual Learning, Reinforcement Learning

Recent Activity

upvoted a paper 2 days ago

Reinforcement World Model Learning for LLM-based Agents

upvoted a paper 2 days ago

Bridging the Agent-World Gap: Text World Models for LLM-based Agents

upvoted a paper 2 days ago

Agentic Environment Engineering for Large Language Models: A Survey of Environment Modeling, Synthesis, Evaluation, and Application

View all activity

Organizations

upvoted 3 papers 2 days ago

Reinforcement World Model Learning for LLM-based Agents

Paper • 2602.05842 • Published Feb 5 • 28

Bridging the Agent-World Gap: Text World Models for LLM-based Agents

Paper • 2606.09032 • Published 21 days ago • 8

Agentic Environment Engineering for Large Language Models: A Survey of Environment Modeling, Synthesis, Evaluation, and Application

Paper • 2606.12191 • Published 19 days ago • 70

upvoted a collection 14 days ago

SWE-FastContext

A family of code-search models powering the Explore subagent for coding agents. • 3 items • Updated 11 days ago • 15

upvoted a collection 21 days ago

Materials

Welcome to IBM’s multi-modal foundation model for materials, FM4M, designed to support and advance research in materials science and chemistry. • 6 items • Updated Jan 28, 2025 • 15

upvoted an article about 1 month ago

Article

Shipping a Trillion Parameters With a Hub Bucket: Delta Weight Sync in TRL

+6

aminediroHF, qgallouedec, kashif, lewtun, edbeeching, albertvillanova, lvwerra, sergiopaniego

•

May 27

• 42

upvoted 2 papers about 1 month ago

The MiniMax-M2 Series: Mini Activations Unleashing Max Real-World Intelligence

Paper • 2605.26494 • Published May 26 • 41

Look Before You Leap: Autonomous Exploration for LLM Agents

Paper • 2605.16143 • Published May 15 • 10

upvoted a collection about 1 month ago

📊 DNA benchmarks

Zero-shot DNA benchmarks for Variant Effect prediction, Sequence Recovery and Perturbation tasks. • 5 items • Updated May 19 • 13

upvoted a paper about 1 month ago

Scaling Test-Time Compute for Agentic Coding

Paper • 2604.16529 • Published Apr 16 • 12

upvoted 2 collections about 1 month ago

Laguna XS.2

Designed for agentic coding and long-horizon work on a local machine. Apache 2.0. • 5 items • Updated 8 days ago • 27

NVIDIA Nemotron v3

Open, Production-ready Enterprise Models • 23 items • Updated 17 days ago • 330

upvoted 3 papers about 2 months ago

ISO-Bench: Can Coding Agents Optimize Real-World Inference Workloads?

Paper • 2602.19594 • Published Feb 23 • 3

Structured Distillation of Web Agent Capabilities Enables Generalization

Paper • 2604.07776 • Published Apr 9 • 23

Locate, Steer, and Improve: A Practical Survey of Actionable Mechanistic Interpretability in Large Language Models

Paper • 2601.14004 • Published Jan 20 • 49

upvoted a collection about 2 months ago

💧 LFM2.5

Collection of post-trained and base LFM2.5 models. • 14 items • Updated 3 days ago • 165

upvoted a paper 2 months ago

Nemotron 3 Nano Omni: Efficient and Open Multimodal Intelligence

Paper • 2604.24954 • Published Apr 27 • 26

upvoted 3 collections 2 months ago

MiMo-V2.5

4 items • Updated Apr 27 • 90

SAM3

6 items • Updated Mar 26 • 291

Qwen3.6

4 items • Updated Apr 22 • 419