Петров Валентина's picture

Петров Валентина

sophiajackson20

·

AI & ML interests

Agent systems for real-world tasks.

Recent Activity

upvoted a paper about 5 hours ago

OmniVerifier-M1: Multimodal Meta-Verifier with Explicit Structured Recalibration

upvoted a paper 3 days ago

HINT-SD: Targeted Hindsight Self-Distillation for Long-Horizon Agents

liked a dataset 4 days ago

DCAgent3/gaia_127_rl_think_npfg_code_contests_900s_45_20260526_201919-traces

View all activity

Organizations

None yet

upvoted a paper about 5 hours ago

OmniVerifier-M1: Multimodal Meta-Verifier with Explicit Structured Recalibration

Paper • 2605.28805 • Published 4 days ago • 9

upvoted a paper 3 days ago

HINT-SD: Targeted Hindsight Self-Distillation for Long-Horizon Agents

Paper • 2605.17873 • Published 13 days ago • 11

upvoted 2 papers 9 days ago

DelTA: Discriminative Token Credit Assignment for Reinforcement Learning from Verifiable Rewards

Paper • 2605.21467 • Published 11 days ago • 204

PhysBrain 1.0 Technical Report

Paper • 2605.15298 • Published 17 days ago • 143

upvoted a paper 11 days ago

Anti-Self-Distillation for Reasoning RL via Pointwise Mutual Information

Paper • 2605.11609 • Published 19 days ago • 195

upvoted a paper 20 days ago

Mean Mode Screaming: Mean--Variance Split Residuals for 1000-Layer Diffusion Transformers

Paper • 2605.06169 • Published 24 days ago • 231

upvoted a paper 26 days ago

From Context to Skills: Can Language Models Learn from Context Skillfully?

Paper • 2604.27660 • Published 28 days ago • 166

upvoted 2 papers about 1 month ago

Heterogeneous Scientific Foundation Model Collaboration

Paper • 2604.27351 • Published Apr 30 • 217

Taming Actor-Observer Asymmetry in Agents via Dialectical Alignment

Paper • 2604.19548 • Published Apr 21 • 16

upvoted 5 papers about 2 months ago

SkillClaw: Let Skills Evolve Collectively with Agentic Evolver

Paper • 2604.08377 • Published Apr 9 • 291

HY-Embodied-0.5: Embodied Foundation Models for Real-World Agents

Paper • 2604.07430 • Published Apr 8 • 189

GBQA: A Game Benchmark for Evaluating LLMs as Quality Assurance Engineers

Paper • 2604.02648 • Published Apr 3 • 47

ViGoR-Bench: How Far Are Visual Generative Models From Zero-Shot Visual Reasoners?

Paper • 2603.25823 • Published Mar 26 • 43

CARLA-Air: Fly Drones Inside a CARLA World -- A Unified Infrastructure for Air-Ground Embodied Intelligence

Paper • 2603.28032 • Published Mar 30 • 343