2 264 96

Raja Biswas

rbiswasfc

AI & ML interests

NLP, Generative AI

Recent Activity

published a model about 4 hours ago

rbiswasfc/mdc-calib

published a model about 4 hours ago

rbiswasfc/dpc-byt5-large-cpt-final-14k

upvoted a paper about 15 hours ago

EnterpriseClawBench: Benchmarking Agents from Real Workplace Sessions

View all activity

Organizations

upvoted a paper about 15 hours ago

EnterpriseClawBench: Benchmarking Agents from Real Workplace Sessions

Paper • 2606.23654 • Published 4 days ago • 75

upvoted 2 papers 1 day ago

ResearchClawBench: A Benchmark for End-to-End Autonomous Scientific Research

Paper • 2606.07591 • Published 29 days ago • 95

Crafter: A Multi-Agent Harness for Editable Scientific Figure Generation from Diverse Inputs

Paper • 2605.30611 • Published 29 days ago • 247

upvoted a paper 15 days ago

AutoResearchClaw: Self-Reinforcing Autonomous Research with Human-AI Collaboration

Paper • 2605.20025 • Published May 19 • 190

upvoted a paper 24 days ago

COLLEAGUE.SKILL: Automated AI Skill Generation via Expert Knowledge Distillation

Paper • 2605.31264 • Published 28 days ago • 118

upvoted 2 papers 30 days ago

Externalization in LLM Agents: A Unified Review of Memory, Skills, Protocols and Harness Engineering

Paper • 2604.08224 • Published Apr 9 • 53

Memory in the Age of AI Agents

Paper • 2512.13564 • Published Dec 15, 2025 • 159

upvoted 4 papers about 1 month ago

SkillOpt: Executive Strategy for Self-Evolving Agent Skills

Paper • 2605.23904 • Published May 22 • 246

SkillsBench: Benchmarking How Well Agent Skills Work Across Diverse Tasks

Paper • 2602.12670 • Published Feb 13 • 62

MemGovern: Enhancing Code Agents through Learning from Governed Human Experiences

Paper • 2601.06789 • Published Jan 11 • 82

SimpleMem: Efficient Lifelong Memory for LLM Agents

Paper • 2601.02553 • Published Jan 5 • 38

upvoted an article about 1 month ago

Article

Harness, Scaffold, and the AI Agent Terms Worth Getting Right

sergiopaniego, ariG23498

•

May 25

• 120

upvoted a collection about 1 month ago

WebWorld

Collection

4 items • Updated May 11 • 11

upvoted a paper about 1 month ago

NanoResearch: Co-Evolving Skills, Memory, and Policy for Personalized Research Automation

Paper • 2605.10813 • Published May 11 • 16

upvoted a collection 2 months ago

NVIDIA Nemotron v3

Collection

Open, Production-ready Enterprise Models • 23 items • Updated 14 days ago • 330

upvoted 3 papers 2 months ago

upvoted an article 2 months ago

Article

Beyond Semantic Similarity: Introducing NVIDIA NeMo Retriever’s Generalizable Agentic Retrieval Pipeline

nvidia

•

Mar 13

• 40

upvoted a paper 3 months ago

OPUS: Towards Efficient and Principled Data Selection in Large Language Model Pre-training in Every Iteration

Paper • 2602.05400 • Published Feb 5 • 356