COrigami: An AI Pipeline for Co-Designing Flat-Foldable Visually Recognisable Origami Paper • 2606.26299 • Published 4 days ago • 4
PhysiFormer: Learning to Simulate Mechanics in World Space Paper • 2606.27364 • Published 3 days ago • 9
view article Article seemore: Implement a Vision Language Model from Scratch AviSoori1x • Jun 23, 2024 • 110
view article Article SeeMoE: Implementing a MoE Vision Language Model from Scratch AviSoori1x • Jun 23, 2024 • 40
RL-Index: Reinforcement Learning for Retrieval Index Reasoning Paper • 2606.16316 • Published 13 days ago • 5
CAVEWOMAN: How Large Language Models Behave Under Linguistic Input and Output Compression Paper • 2606.24083 • Published 5 days ago • 4
Look Light, Think Heavy: What Multimodal Chain-of-Thought Reasoning Can and Cannot Do Paper • 2606.22565 • Published 7 days ago • 8
Autodata: An agentic data scientist to create high quality synthetic data Paper • 2606.25996 • Published 4 days ago • 10
EBench: Elemental Diagnosis of Generalist Mobile Manipulation Policies Paper • 2606.18239 • Published 8 days ago • 15
Beyond NL2Code: A Structured Survey of Multimodal Code Intelligence Paper • 2606.15932 • Published 12 days ago • 36
DREAM: Dense Retrieval Embeddings via Autoregressive Modeling Paper • 2606.24667 • Published 5 days ago • 4
DINOv3-Diffusion Policy: Self-Supervised Large Visual Model for Visuomotor Diffusion Policy Learning Paper • 2509.17684 • Published Sep 22, 2025 • 1
Code2LoRA: Hypernetwork-Generated Adapters for Code Language Models under Software Evolution Paper • 2606.06492 • Published 24 days ago • 94
From Trainee to Trainer: LLM-Designed Training Environment for RL with Multi-Agent Reasoning Paper • 2606.17682 • Published 12 days ago • 26