Victor Gallego

vicgalle

https://www.vicgalle.net

AI & ML interests

Preference fine-tuning, alignment & synthetic data. Building LLMs in general!

Recent Activity

upvoted a paper about 13 hours ago

Metal-Sci: A Scientific Compute Benchmark for Evolutionary LLM Kernel Search on Apple Silicon

submitted a paper about 13 hours ago

Metal-Sci: A Scientific Compute Benchmark for Evolutionary LLM Kernel Search on Apple Silicon

authored a paper 14 days ago

Discovering Agentic Safety Specifications from 1-Bit Danger Signals

View all activity

Organizations

upvoted a paper about 13 hours ago

Metal-Sci: A Scientific Compute Benchmark for Evolutionary LLM Kernel Search on Apple Silicon

Paper • 2605.09708 • Published 3 days ago • 3

upvoted a paper 14 days ago

Discovering Agentic Safety Specifications from 1-Bit Danger Signals

Paper • 2604.23210 • Published 18 days ago • 4

upvoted a paper 28 days ago

From Reasoning to Agentic: Credit Assignment in Reinforcement Learning for Large Language Models

Paper • 2604.09459 • Published 30 days ago • 13

upvoted an article about 1 month ago

Article

Waypoint-1.5: Higher-Fidelity Interactive Worlds for Everyday GPUs

lapp0, LouisCastricato, ScottieFox, shahbuland, xAesthetics

•

Apr 9

• 29

upvoted 2 papers about 2 months ago

STEM Agent: A Self-Adapting, Tool-Enabled, Extensible Architecture for Multi-Protocol AI Agent Systems

Paper • 2603.22359 • Published Mar 22 • 4

Cooperation and Exploitation in LLM Policy Synthesis for Sequential Social Dilemmas

Paper • 2603.19453 • Published Mar 19 • 6

upvoted a changelog about 2 months ago

Hugging Face Changelog

Hugging Face Papers for AI Agents

Mar 18

• 139

upvoted a paper about 2 months ago

AI Scientist via Synthetic Task Scaling

Paper • 2603.17216 • Published Mar 17 • 4

upvoted 2 papers 3 months ago

2Mamba2Furious: Linear in Complexity, Competitive in Accuracy

Paper • 2602.17363 • Published Feb 19 • 8

Experiential Reinforcement Learning

Paper • 2602.13949 • Published Feb 15 • 74

upvoted a paper 4 months ago

Distilling Feedback into Memory-as-a-Tool

Paper • 2601.05960 • Published Jan 9 • 3

upvoted an article 5 months ago

Article

The Open Evaluation Standard: Benchmarking NVIDIA Nemotron 3 Nano with NeMo Evaluator

nvidia

•

Dec 17, 2025

• 49

upvoted a paper 5 months ago

Agent READMEs: An Empirical Study of Context Files for Agentic Coding

Paper • 2511.12884 • Published Nov 17, 2025 • 28

upvoted a paper 7 months ago

Agent Learning via Early Experience

Paper • 2510.08558 • Published Oct 9, 2025 • 276

upvoted an article 7 months ago

Article

mem-agent: Equipping LLM Agents with Memory Using RL

driaforall

•

Oct 9, 2025

• 33

upvoted 5 papers 9 months ago

Victor Gallego

AI & ML interests

Recent Activity

Organizations

vicgalle's activity

Waypoint-1.5: Higher-Fidelity Interactive Worlds for Everyday GPUs

Hugging Face Papers for AI Agents

The Open Evaluation Standard: Benchmarking NVIDIA Nemotron 3 Nano with NeMo Evaluator

mem-agent: Equipping LLM Agents with Memory Using RL