THE ORB - a galois77 Collection

galois77 's Collections

Thousand brains theory

energy based models

Image generation

Training optimization

Benchmarks and challenges

THE ORB

updated 11 days ago

UniVA: Universal Video Agent towards Open-Source Next-Generation Video Generalist

Paper • 2511.08521 • Published Nov 11, 2025 • 39
Black-Box On-Policy Distillation of Large Language Models

Paper • 2511.10643 • Published Nov 13, 2025 • 54
Depth Anything 3: Recovering the Visual Space from Any Views

Paper • 2511.10647 • Published Nov 13, 2025 • 102
VGGT: Visual Geometry Grounded Transformer

Paper • 2503.11651 • Published Mar 14, 2025 • 40
Music Flamingo: Scaling Music Understanding in Audio Language Models

Paper • 2511.10289 • Published Nov 13, 2025 • 20
Canvas-to-Image: Compositional Image Generation with Multimodal Controls

Paper • 2511.21691 • Published Nov 26, 2025 • 36
DiffusionBrowser: Interactive Diffusion Previews via Multi-Branch Decoders

Paper • 2512.13690 • Published Dec 15, 2025 • 3
SpotEdit: Selective Region Editing in Diffusion Transformers

Paper • 2512.22323 • Published Dec 26, 2025 • 39
AOHP: An Open-Source OS-Level Agent Harness for Personalized, Efficient and Secure Interaction

Paper • 2606.23449 • Published 14 days ago • 33