World Pilot: Steering Vision-Language-Action Models with World-Action Priors Paper • 2606.12403 • Published 20 days ago • 26
ICA Lens: Interpreting Language Models Without Training Another Dictionary Paper • 2606.11722 • Published 20 days ago • 15
Retrospective Harness Optimization: Improving LLM Agents via Self-Preference over Trajectory Rollouts Paper • 2606.05922 • Published 25 days ago • 69
Towards Long-Horizon Interpretability: Efficient and Faithful Multi-Token Attribution for Reasoning LLMs Paper • 2602.01914 • Published Feb 2 • 1
A-MemGuard: A Proactive Defense Framework for LLM-Based Agent Memory Paper • 2510.02373 • Published Sep 29, 2025 • 10
view article Article Llama can now see and run on your device - welcome Llama 3.2 +5 merve, philschmid, osanseviero, reach-vb, lewtun, ariG23498, pcuenq • Sep 25, 2024 • 191
Llama 3.2 Collection This collection hosts the transformers and original repos of the Llama 3.2 and Llama Guard 3 • 15 items • Updated Dec 6, 2024 • 675
Meta Llama 3 Collection This collection hosts the transformers and original repos of the Meta Llama 3 and Llama Guard 2 releases • 5 items • Updated Dec 6, 2024 • 975
Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone Paper • 2404.14219 • Published Apr 22, 2024 • 262
My Dataset Suite Collection I create these bilingual fine-tuning / dpo dataset for easier creation of En-Zh models • 3 items • Updated Apr 1, 2024 • 2
Faro Series Collection Faro chat models are fine-tuned on Fusang, focusing on practicality and long-context modeling. • 6 items • Updated Apr 11, 2024 • 3
Qwen1.5 Collection Qwen1.5 is the improved version of Qwen, the large language model series developed by Alibaba Cloud. • 53 items • Updated Mar 2 • 214