Victor Gallego

vicgalle

https://www.vicgalle.net

AI & ML interests

Preference fine-tuning, alignment & synthetic data. Building LLMs in general!

Recent Activity

authored a paper 10 days ago

Discovering Agentic Safety Specifications from 1-Bit Danger Signals

submitted a paper 10 days ago

Discovering Agentic Safety Specifications from 1-Bit Danger Signals

upvoted a paper 10 days ago

Discovering Agentic Safety Specifications from 1-Bit Danger Signals

View all activity

Organizations

authored a paper 10 days ago

Discovering Agentic Safety Specifications from 1-Bit Danger Signals

Paper • 2604.23210 • Published 13 days ago • 4

submitted a paper to Daily Papers 10 days ago

Discovering Agentic Safety Specifications from 1-Bit Danger Signals

Paper • 2604.23210 • Published 13 days ago • 4

upvoted a paper 10 days ago

Discovering Agentic Safety Specifications from 1-Bit Danger Signals

Paper • 2604.23210 • Published 13 days ago • 4

liked a model 14 days ago

deepseek-ai/DeepSeek-V4-Pro

Text Generation • 862B • Updated 2 days ago • 1.06M • • 3.74k

liked a model 16 days ago

openai/privacy-filter

Token Classification • 1B • Updated 16 days ago • 173k • 1.36k

upvoted a paper 24 days ago

From Reasoning to Agentic: Credit Assignment in Reinforcement Learning for Large Language Models

Paper • 2604.09459 • Published 25 days ago • 13

upvoted an article 28 days ago

Article

Waypoint-1.5: Higher-Fidelity Interactive Worlds for Everyday GPUs

29 days ago

•

liked a dataset 28 days ago

mishig/autoresearch-optimizer-findings

Updated 28 days ago • 204 • 8

liked a model about 1 month ago

chromadb/context-1

Text Generation • 21B • Updated Mar 30 • 1.67k • 401

upvoted a paper about 1 month ago

STEM Agent: A Self-Adapting, Tool-Enabled, Extensible Architecture for Multi-Protocol AI Agent Systems

Paper • 2603.22359 • Published Mar 22 • 4

submitted a paper to Daily Papers about 1 month ago

STEM Agent: A Self-Adapting, Tool-Enabled, Extensible Architecture for Multi-Protocol AI Agent Systems

Paper • 2603.22359 • Published Mar 22 • 4

authored a paper about 2 months ago

Cooperation and Exploitation in LLM Policy Synthesis for Sequential Social Dilemmas

Paper • 2603.19453 • Published Mar 19 • 6

submitted a paper to Daily Papers about 2 months ago

Cooperation and Exploitation in LLM Policy Synthesis for Sequential Social Dilemmas

Paper • 2603.19453 • Published Mar 19 • 6

upvoted a paper about 2 months ago

Cooperation and Exploitation in LLM Policy Synthesis for Sequential Social Dilemmas

Paper • 2603.19453 • Published Mar 19 • 6

upvoted a changelog about 2 months ago

Hugging Face Changelog

Hugging Face Papers for AI Agents

Mar 18

• 139

upvoted a paper about 2 months ago

AI Scientist via Synthetic Task Scaling

Paper • 2603.17216 • Published Mar 17 • 4

updated a dataset about 2 months ago

vicgalle/rubric-feedback-bench

Viewer • Updated Mar 17 • 42 • 62 • 1

liked 2 models about 2 months ago

mistralai/Mistral-Small-4-119B-2603

119B • Updated 11 days ago • 63.1k • 372

mistralai/Leanstral-2603

Updated 17 days ago • 193 • 155

liked a dataset about 2 months ago

SAIRfoundation/equational-theories-selected-problems

Viewer • Updated 10 days ago • 2.67k • 2.22k • 10

Victor Gallego

AI & ML interests

Recent Activity

Organizations

vicgalle's activity

Waypoint-1.5: Higher-Fidelity Interactive Worlds for Everyday GPUs

Hugging Face Papers for AI Agents