2 19 13

Martin Seeler

MartinSeeler

https://martinseeler.com

AI & ML interests

LLM, Conversational UI

Organizations

None yet

upvoted a paper 2 months ago

EnterpriseOps-Gym: Environments and Evaluations for Stateful Agentic Planning and Tool Use in Enterprise Settings

Paper • 2603.13594 • Published Mar 13 • 149

upvoted 2 articles 6 months ago

Article

Nemotron 3 Nano \- A new Standard for Efficient, Open, and Intelligent Agentic Models

nvidia

•

Dec 15, 2025

• 111

Article

We Got Claude to Fine-Tune an Open Source LLM

burtenshaw, evalstate

•

Dec 4, 2025

• 627

upvoted 2 articles 7 months ago

Article

Supercharge your OCR Pipelines with Open Models

merve, ariG23498, davanstrien, hynky, andito, reach-vb, pcuenq

•

Oct 21, 2025

• 314

Article

StackLLaMA: A hands-on guide to train LLaMA with RLHF

edbeeching, kashif, ybelkada, lewtun, lvwerra, nazneen, natolambert

•

Apr 5, 2023

• 48

upvoted 5 papers 8 months ago

MCPMark: A Benchmark for Stress-Testing Realistic and Comprehensive MCP Use

Paper • 2509.24002 • Published Sep 28, 2025 • 180

The Dragon Hatchling: The Missing Link between the Transformer and Models of the Brain

Paper • 2509.26507 • Published Sep 30, 2025 • 550

upvoted a paper 9 months ago

Why Language Models Hallucinate

Paper • 2509.04664 • Published Sep 4, 2025 • 199

upvoted a paper 11 months ago

Mind2Web 2: Evaluating Agentic Search with Agent-as-a-Judge

Paper • 2506.21506 • Published Jun 26, 2025 • 52

upvoted a paper 12 months ago

Will It Still Be True Tomorrow? Multilingual Evergreen Question Classification to Improve Trustworthy QA

Paper • 2505.21115 • Published May 27, 2025 • 144

upvoted 2 papers about 1 year ago

Paper2Code: Automating Code Generation from Scientific Papers in Machine Learning

Paper • 2504.17192 • Published Apr 24, 2025 • 124

RIG: Synergizing Reasoning and Imagination in End-to-End Generalist Policy

Paper • 2503.24388 • Published Mar 31, 2025 • 29

upvoted an article about 1 year ago

Article

🦸🏻#14: What Is MCP, and Why Is Everyone – Suddenly!– Talking About It?

Kseniase

•

Mar 17, 2025

• 357

upvoted 3 papers over 1 year ago

LLM-Microscope: Uncovering the Hidden Role of Punctuation in Context Memory of Transformers

Paper • 2502.15007 • Published Feb 20, 2025 • 175

Talk Structurally, Act Hierarchically: A Collaborative Framework for LLM Multi-Agent Systems

Paper • 2502.11098 • Published Feb 16, 2025 • 13

SWE-Lancer: Can Frontier LLMs Earn $1 Million from Real-World Freelance Software Engineering?

Paper • 2502.12115 • Published Feb 17, 2025 • 46

Martin Seeler

AI & ML interests

Organizations

MartinSeeler's activity

Nemotron 3 Nano \- A new Standard for Efficient, Open, and Intelligent Agentic Models

We Got Claude to Fine-Tune an Open Source LLM

Supercharge your OCR Pipelines with Open Models

StackLLaMA: A hands-on guide to train LLaMA with RLHF

🦸🏻#14: What Is MCP, and Why Is Everyone – Suddenly!– Talking About It?