Neural Thickets: Diverse Task Experts Are Dense Around Pretrained Weights Paper • 2603.12228 • Published 10 days ago • 11
AutoHarness: improving LLM agents by automatically synthesizing a code harness Paper • 2603.03329 • Published Feb 10 • 1
AI Gamestore: Scalable, Open-Ended Evaluation of Machine General Intelligence with Human Games Paper • 2602.17594 • Published about 1 month ago • 9
Fast Inference from Transformers via Speculative Decoding Paper • 2211.17192 • Published Nov 30, 2022 • 11
Hallucination-Free? Assessing the Reliability of Leading AI Legal Research Tools Paper • 2405.20362 • Published May 30, 2024 • 3
Unsloth Dynamic 2.0 Quants Collection New 2.0 version of our Dynamic GGUF + Quants. Dynamic 2.0 achieves superior accuracy & SOTA quantization performance. • 80 items • Updated 11 days ago • 485
ARLArena: A Unified Framework for Stable Agentic Reinforcement Learning Paper • 2602.21534 • Published 25 days ago • 23
Demystifying Reinforcement Learning in Agentic Reasoning Paper • 2510.11701 • Published Oct 13, 2025 • 33
The Landscape of Agentic Reinforcement Learning for LLMs: A Survey Paper • 2509.02547 • Published Sep 2, 2025 • 236
pplx-embed Collection Diffusion-Pretrained Dense and Contextual Embeddings • 7 items • Updated 24 days ago • 88
The Devil Behind Moltbook: Anthropic Safety is Always Vanishing in Self-Evolving AI Societies Paper • 2602.09877 • Published Feb 10 • 197
BrowseComp-V^3: A Visual, Vertical, and Verifiable Benchmark for Multimodal Browsing Agents Paper • 2602.12876 • Published Feb 13 • 10