DiRL: An Efficient Post-Training Framework for Diffusion Language Models Paper • 2512.22234 • Published 8 days ago • 16
Beyond Real: Imaginary Extension of Rotary Position Embeddings for Long-Context LLMs Paper • 2512.07525 • Published 23 days ago • 55
Thinking with Video: Video Generation as a Promising Multimodal Reasoning Paradigm Paper • 2511.04570 • Published Nov 6 • 210
Energy-Based Transformers are Scalable Learners and Thinkers Paper • 2507.02092 • Published Jul 2 • 69
PRIMA.CPP: Speeding Up 70B-Scale LLM Inference on Low-Resource Everyday Home Clusters Paper • 2504.08791 • Published Apr 7 • 137
Orenguteng/Llama-3.1-8B-Lexi-Uncensored-V2 Text Generation • 8B • Updated Sep 25, 2024 • 115k • • 253