AgencyBench: Benchmarking the Frontiers of Autonomous Agents in 1M-Token Real-World Contexts Paper • 2601.11044 • Published 10 days ago • 34
Dr. Zero: Self-Evolving Search Agents without Training Data Paper • 2601.07055 • Published 15 days ago • 20
view article Article Illustrating Reinforcement Learning from Human Feedback (RLHF) +2 Dec 9, 2022 • 393
Youtu-LLM: Unlocking the Native Agentic Potential for Lightweight Large Language Models Paper • 2512.24618 • Published 27 days ago • 141
Is There a Better Source Distribution than Gaussian? Exploring Source Distributions for Image Flow Matching Paper • 2512.18184 • Published Dec 20, 2025 • 21
VL-JEPA: Joint Embedding Predictive Architecture for Vision-language Paper • 2512.10942 • Published Dec 11, 2025 • 49
Nemotron 3 Nano: Open, Efficient Mixture-of-Experts Hybrid Mamba-Transformer Model for Agentic Reasoning Paper • 2512.20848 • Published Dec 23, 2025 • 35
view article Article nanoVLM: The simplest repository to train your VLM in pure PyTorch +5 May 21, 2025 • 251
view article Article Tricks from OpenAI gpt-oss YOU 🫵 can use with transformers +5 Sep 11, 2025 • 178
view article Article Tokenization in Transformers v5: Simpler, Clearer, and More Modular +4 Dec 18, 2025 • 116
A Survey of Vibe Coding with Large Language Models Paper • 2510.12399 • Published Oct 14, 2025 • 50
Achieving Olympia-Level Geometry Large Language Model Agent via Complexity Boosting Reinforcement Learning Paper • 2512.10534 • Published Dec 11, 2025 • 32
One Layer Is Enough: Adapting Pretrained Visual Encoders for Image Generation Paper • 2512.07829 • Published Dec 8, 2025 • 22
view article Article MiniGuard-v0.1: Prem's Guardrail Model Redefining the Pareto Frontier Dec 12, 2025 • 21
CLaRa: Bridging Retrieval and Generation with Continuous Latent Reasoning Paper • 2511.18659 • Published Nov 24, 2025 • 21
From Code Foundation Models to Agents and Applications: A Practical Guide to Code Intelligence Paper • 2511.18538 • Published Nov 23, 2025 • 294