Efficiently Reconstructing Dynamic Scenes One D4RT at a Time Paper β’ 2512.08924 β’ Published 2 days ago β’ 4
Preserving Source Video Realism: High-Fidelity Face Swapping for Cinematic Quality Paper β’ 2512.07951 β’ Published 3 days ago β’ 45
RynnVLA-002: A Unified Vision-Language-Action and World Model Paper β’ 2511.17502 β’ Published 20 days ago β’ 24
Parrot: Persuasion and Agreement Robustness Rating of Output Truth -- A Sycophancy Robustness Benchmark for LLMs Paper β’ 2511.17220 β’ Published 21 days ago β’ 17
Benchmarking Diversity in Image Generation via Attribute-Conditional Human Evaluation Paper β’ 2511.10547 β’ Published 28 days ago β’ 4
UniVA: Universal Video Agent towards Open-Source Next-Generation Video Generalist Paper β’ 2511.08521 β’ Published about 1 month ago β’ 37
One Small Step in Latent, One Giant Leap for Pixels: Fast Latent Upscale Adapter for Your Diffusion Models Paper β’ 2511.10629 β’ Published 28 days ago β’ 122
Depth Anything 3: Recovering the Visual Space from Any Views Paper β’ 2511.10647 β’ Published 28 days ago β’ 93
Kimi Linear: An Expressive, Efficient Attention Architecture Paper β’ 2510.26692 β’ Published Oct 30 β’ 117
WithAnyone: Towards Controllable and ID Consistent Image Generation Paper β’ 2510.14975 β’ Published Oct 16 β’ 84
Durian: Dual Reference-guided Portrait Animation with Attribute Transfer Paper β’ 2509.04434 β’ Published Sep 4 β’ 10
OpenAI-GPT 20B, 37B ,120B: Neo, reg, uncensored, ablit. Collection OpenAi's model in various sizes and formats, including NEO Imatrix, DI, Tri Matrix, Uncensored, Albiterated, and Brainstorm 20x (37B). β’ 9 items β’ Updated 3 days ago β’ 9
200+ Roleplay, Creative Writing, Uncensored, NSFW models. Collection Oldest models listed first, with Newest models at bottom of the page. Most repos have full examples, instructions, best settings and so on. β’ 344 items β’ Updated about 21 hours ago β’ 385
InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency Paper β’ 2508.18265 β’ Published Aug 25 β’ 208
T-LoRA: Single Image Diffusion Model Customization Without Overfitting Paper β’ 2507.05964 β’ Published Jul 8 β’ 119
Thinking with Images for Multimodal Reasoning: Foundations, Methods, and Future Frontiers Paper β’ 2506.23918 β’ Published Jun 30 β’ 89