Yedidia AGNIMO's picture

54

Yedidia AGNIMO

YedsonUQ

·

AI & ML interests

[Uncertainty Quantification, "Hallucinations"] in LLMs, Federated Learning

Organizations

None yet

upvoted 9 papers 5 months ago

Statistical Estimation of Adversarial Risk in Large Language Models under Best-of-N Sampling

Paper • 2601.22636 • Published Jan 30 • 22

Reinforcement World Model Learning for LLM-based Agents

Paper • 2602.05842 • Published Feb 5 • 28

Reinforced Attention Learning

Paper • 2602.04884 • Published Feb 4 • 30

F-GRPO: Don't Let Your Policy Learn the Obvious and Forget the Rare

Paper • 2602.06717 • Published Feb 6 • 75

CAR-bench: Evaluating the Consistency and Limit-Awareness of LLM Agents under Real-World Uncertainty

Paper • 2601.22027 • Published Jan 29 • 87

Taming Hallucinations: Boosting MLLMs' Video Understanding via Counterfactual Video Generation

Paper • 2512.24271 • Published Dec 30, 2025 • 63

Can LLMs Clean Up Your Mess? A Survey of Application-Ready Data Preparation with LLMs

Paper • 2601.17058 • Published Jan 22 • 190

Harder Is Better: Boosting Mathematical Reasoning via Difficulty-Aware GRPO and Multi-Aspect Question Reformulation

Paper • 2601.20614 • Published Jan 28 • 119

Agentic Reasoning for Large Language Models

Paper • 2601.12538 • Published Jan 18 • 207

upvoted 11 papers 6 months ago

When Reasoning Meets Its Laws

Paper • 2512.17901 • Published Dec 19, 2025 • 62

Deep Research: A Systematic Survey

Paper • 2512.02038 • Published Nov 24, 2025 • 73

SciEvalKit: An Open-source Evaluation Toolkit for Scientific General Intelligence

Paper • 2512.22334 • Published Dec 26, 2025 • 37

COMPASS: A Framework for Evaluating Organization-Specific Policy Alignment in LLMs

Paper • 2601.01836 • Published Jan 5 • 10

AI Meets Brain: Memory Systems from Cognitive Neuroscience to Autonomous Agents

Paper • 2512.23343 • Published Dec 29, 2025 • 30

Diversity or Precision? A Deep Dive into Next Token Prediction

Paper • 2512.22955 • Published Dec 28, 2025 • 10

Confidence Estimation for LLMs in Multi-turn Interactions

Paper • 2601.02179 • Published Jan 5 • 17

Fantastic Reasoning Behaviors and Where to Find Them: Unsupervised Discovery of the Reasoning Process

Paper • 2512.23988 • Published Dec 30, 2025 • 19

Recursive Language Models

Paper • 2512.24601 • Published Dec 31, 2025 • 99

Deep Delta Learning

Paper • 2601.00417 • Published Jan 1 • 34

Nested Learning: The Illusion of Deep Learning Architectures

Paper • 2512.24695 • Published Dec 31, 2025 • 46