Pratyay Banerjee's picture

🔄 In a Training Loop

Pratyay Banerjee

Neilblaze

·

https://neilblaze.live

AI & ML interests

IR, NLP, Pattern Recognition, xAI, Interpretability, Evals

Recent Activity

liked a model 3 days ago

google/medgemma-1.5-4b-it

upvoted a paper 3 days ago

Memory is Reconstructed, Not Retrieved: Graph Memory for LLM Agents

upvoted a paper 3 days ago

FastContext: Training Efficient Repository Explorer for Coding Agents

View all activity

Organizations

upvoted 5 papers 3 days ago

Memory is Reconstructed, Not Retrieved: Graph Memory for LLM Agents

Paper • 2606.06036 • Published 21 days ago • 73

FastContext: Training Efficient Repository Explorer for Coding Agents

Paper • 2606.14066 • Published 13 days ago • 91

Toward Generalist Autonomous Research via Hypothesis-Tree Refinement

Paper • 2606.11926 • Published 15 days ago • 117

EvoArena: Tracking Memory Evolution for Robust LLM Agents in Dynamic Environments

Paper • 2606.13681 • Published 14 days ago • 140

Looped World Models

Paper • 2606.18208 • Published 9 days ago • 457

upvoted 6 papers 6 days ago

Measuring Epistemic Resilience of LLMs Under Misleading Medical Context

Paper • 2606.12291 • Published 15 days ago • 58

Learning from the Self-future: On-policy Self-distillation for dLLMs

Paper • 2606.18195 • Published 9 days ago • 74

Robust-U1: Can MLLMs Self-Recover Corrupted Visual Content for Robust Understanding?

Paper • 2606.08063 • Published 19 days ago • 79

Redesign Mixture-of-Experts Routers with Manifold Power Iteration

Paper • 2606.12397 • Published 15 days ago • 87

VibeThinker-3B: Exploring the Frontier of Verifiable Reasoning in Small Language Models

Paper • 2606.16140 • Published 10 days ago • 112

Your UnEmbedding Matrix is Secretly a Feature Lens for Text Embeddings

Paper • 2606.07502 • Published 20 days ago • 97

upvoted an article 12 days ago

Article

Build Small Hackathon With Cohere Models

CohereLabs

•

20 days ago

• 5

upvoted 7 papers 13 days ago

OpenSkill: Open-World Self-Evolution for LLM Agents

Paper • 2606.06741 • Published 21 days ago • 28

Rethinking the Divergence Regularization in LLM RL

Paper • 2606.09821 • Published 17 days ago • 33

SearchSwarm: Towards Delegation Intelligence in Agentic LLMs for Long-Horizon Deep Research

Paper • 2606.09730 • Published 17 days ago • 52

Retrospective Harness Optimization: Improving LLM Agents via Self-Preference over Trajectory Rollouts

Paper • 2606.05922 • Published 20 days ago • 67

LatentSkill: From In-Context Textual Skills to In-Weight Latent Skills for LLM Agents

Paper • 2606.06087 • Published 21 days ago • 64

KVarN: Variance-Normalized KV-Cache Quantization Mitigates Error Accumulation in Reasoning Tasks

Paper • 2606.03458 • Published 23 days ago • 65

Agents' Last Exam

Paper • 2606.05405 • Published 22 days ago • 360

upvoted an article 14 days ago

Article

Introducing North Mini Code: Cohere’s First Model For Developers

CohereLabs

•

15 days ago

• 74