7 99 47

Denis Akhiyarov

dtanow

AI & ML interests

AI Code Generation with LLMs

Recent Activity

upvoted a paper 13 days ago

MiniMax Sparse Attention

upvoted a paper 15 days ago

SWE-Explore: Benchmarking How Coding Agents Explore Repositories

upvoted an article 16 days ago

Can Voice Agents Handle Bilingual Customers? Benchmarking Frontier ASR on Code-Switched Speech

View all activity

Organizations

upvoted a paper 13 days ago

MiniMax Sparse Attention

Paper • 2606.13392 • Published 15 days ago • 146

upvoted a paper 15 days ago

SWE-Explore: Benchmarking How Coding Agents Explore Repositories

Paper • 2606.07297 • Published 21 days ago • 119

upvoted an article 16 days ago

Article

Can Voice Agents Handle Bilingual Customers? Benchmarking Frontier ASR on Code-Switched Speech

ServiceNow-AI

•

16 days ago

• 44

liked a Space 24 days ago

MTEB Leaderboard

📊

7.5k

Embedding Leaderboard

upvoted a paper 27 days ago

Agentic Context Engineering: Evolving Contexts for Self-Improving Language Models

Paper • 2510.04618 • Published Oct 6, 2025 • 134

upvoted a paper 28 days ago

Hyperagents

Paper • 2603.19461 • Published Mar 19 • 51

upvoted 3 papers about 1 month ago

Code as Agent Harness

Paper • 2605.18747 • Published May 18 • 223

EVA-Bench: A New End-to-end Framework for Evaluating Voice Agents

Paper • 2605.13841 • Published May 13 • 75

Do Enterprise Systems Need Learned World Models? The Importance of Context to Infer Dynamics

Paper • 2605.12178 • Published May 12 • 65

liked a model about 2 months ago

nvidia/llama-nemotron-embed-vl-1b-v2

upvoted an article about 2 months ago

Article

Vision Language Models (Better, faster, stronger)

merve, sergiopaniego, ariG23498, pcuenq, andito

•

May 12, 2025

• 614

upvoted a paper 2 months ago

Rethinking Generalization in Reasoning SFT: A Conditional Analysis on Optimization, Data, and Model Capability

Paper • 2604.06628 • Published Apr 8 • 328

upvoted 2 papers 3 months ago

Apriel-Reasoner: RL Post-Training for General-Purpose and Efficient Reasoning

Paper • 2604.02007 • Published Apr 2 • 14

Therefore I am. I Think

Paper • 2604.01202 • Published Apr 2 • 33

submitted a paper to Daily Papers 3 months ago

Therefore I am. I Think

Paper • 2604.01202 • Published Apr 2 • 33

upvoted 2 papers 3 months ago

Terminal Agents Suffice for Enterprise Automation

Paper • 2604.00073 • Published Mar 31 • 98

CUA-Suite: Massive Human-annotated Video Demonstrations for Computer-Use Agents

Paper • 2603.24440 • Published Mar 25 • 99

liked a dataset 3 months ago

ServiceNow-AI/eva

Viewer • Updated Mar 24 • 50 • 74 • 71

upvoted an article 3 months ago

Article

A New Framework for Evaluating Voice Agents (EVA)

ServiceNow-AI

•

Mar 24

• 95

upvoted a paper 3 months ago

Reasoning as Compression: Unifying Budget Forcing via the Conditional Information Bottleneck

Paper • 2603.08462 • Published Mar 9 • 23

Denis Akhiyarov

AI & ML interests

Recent Activity

Organizations

dtanow's activity

Can Voice Agents Handle Bilingual Customers? Benchmarking Frontier ASR on Code-Switched Speech

MTEB Leaderboard

Vision Language Models (Better, faster, stronger)

A New Framework for Evaluating Voice Agents (EVA)