🏗️ Building on HF

Anuraag Rath

ChaoticEconomist

7 25 9

https://en.everybodywiki.com/Anuraag_Rath

AI & ML interests

Reinforcement Learning, Game Theoretic Models, LoRA, Agentic Orchestration, RAG

Recent Activity

upvoted a paper 15 days ago

Streaming Communication in Multi-Agent Reasoning

upvoted a paper 15 days ago

Human Psychometric Questionnaires Mischaracterize LLM Behavior

upvoted a paper 15 days ago

SearchSwarm: Towards Delegation Intelligence in Agentic LLMs for Long-Horizon Deep Research

View all activity

Organizations

upvoted 4 papers 15 days ago

Streaming Communication in Multi-Agent Reasoning

Paper • 2606.05158 • Published 27 days ago • 30

Human Psychometric Questionnaires Mischaracterize LLM Behavior

Paper • 2509.10078 • Published May 29 • 36

SearchSwarm: Towards Delegation Intelligence in Agentic LLMs for Long-Horizon Deep Research

Paper • 2606.09730 • Published 22 days ago • 54

MiniMax Sparse Attention

Paper • 2606.13392 • Published 19 days ago • 149

upvoted 2 papers 20 days ago

Where Do Deep-Research Agents Go Wrong? Span-Level Error Localization in Agent Trajectories

Paper • 2606.02060 • Published 29 days ago • 57

K-BrowseComp: A Web Browsing Agent Benchmark Grounded in Korean Contexts

Paper • 2606.02404 • Published 29 days ago • 59

upvoted a paper 23 days ago

COLLEAGUE.SKILL: Automated AI Skill Generation via Expert Knowledge Distillation

Paper • 2605.31264 • Published May 29 • 120

upvoted 2 papers 24 days ago

Harness-1: Reinforcement Learning for Search Agents with State-Externalizing Harnesses

Paper • 2606.02373 • Published 29 days ago • 59

KVarN: Variance-Normalized KV-Cache Quantization Mitigates Error Accumulation in Reasoning Tasks

Paper • 2606.03458 • Published 28 days ago • 67

upvoted 2 papers about 2 months ago

MathNet: a Global Multimodal Benchmark for Mathematical Reasoning and Retrieval

Paper • 2604.18584 • Published Apr 20 • 15

Efficient Training on Multiple Consumer GPUs with RoundPipe

Paper • 2604.27085 • Published Apr 29 • 47

upvoted a paper 2 months ago

A Mathematical Framework for Custom Reward Functions in Job Application Evaluation using Reinforcement Learning

Paper • 2511.16073 • Published Nov 20, 2025 • 1

upvoted an article 2 months ago

Article

Complete Guide: Training and Inference with π₀.₅ (pi05) on Custom Datasets

Tonic

•

Dec 13, 2025

• 5

upvoted 5 papers 2 months ago

upvoted an article 2 months ago

Article

Training and Finetuning Multimodal Embedding & Reranker Models with Sentence Transformers

tomaarsen

•

Apr 16

• 73

upvoted a paper 2 months ago

From Reasoning to Agentic: Credit Assignment in Reinforcement Learning for Large Language Models

Paper • 2604.09459 • Published Apr 13 • 14

Anuraag Rath

AI & ML interests

Recent Activity

Organizations

ChaoticEconomist's activity

Complete Guide: Training and Inference with π₀.₅ (pi05) on Custom Datasets

Training and Finetuning Multimodal Embedding & Reranker Models with Sentence Transformers