Retrieve, Don't Retrain: Extending Vision Language Action Models to New Tasks at Test Time Paper • 2606.15631 • Published 11 days ago • 16
Every Attention Matters: An Efficient Hybrid Architecture for Long-Context Reasoning Paper • 2510.19338 • Published Oct 22, 2025 • 117
Llama 3.1 Collection This collection hosts the transformers and original repos of the Llama 3.1, Llama Guard 3 and Prompt Guard models • 11 items • Updated Dec 6, 2024 • 711
ORPO: Monolithic Preference Optimization without Reference Model Paper • 2403.07691 • Published Mar 12, 2024 • 73
view article Article Expanding Model Context and Creating Chat Models with a Single Click maywell • Apr 28, 2024 • 38