andreydelpozo 's Collections explorations
updated
teknium/OpenHermes-2.5-Mistral-7B
Text Generation
• Updated
• 151k
• 887
Text-to-Image
• Updated
• 34.9k
• • 2.13k
Text Generation
• 9B • Updated
• 53.8k
• 1.24k
dphn/dolphin-2.2.1-mistral-7b
Text Generation
• 7B • Updated
• 852
• 198
dphn/dolphin-2.5-mixtral-8x7b
Text Generation
• 47B • Updated
• 1.58k
• 1.24k
dphn/dolphin-2.6-mistral-7b-dpo-laser
Text Generation
• 7B • Updated
• 280
• 120
ise-uiuc/Magicoder-Evol-Instruct-110K
Viewer
• Updated
• 111k • 2.92k
• 173
Generative Inbetweening: Adapting Image-to-Video Models for Keyframe
Interpolation
Paper
• 2408.15239
• Published
• 30
WebShaper: Agentically Data Synthesizing via Information-Seeking
Formalization
Paper
• 2507.15061
• Published
• 60
Ovi: Twin Backbone Cross-Modal Fusion for Audio-Video Generation
Paper
• 2510.01284
• Published
• 37
OBS-Diff: Accurate Pruning For Diffusion Models in One-Shot
Paper
• 2510.06751
• Published
• 22
Evolution Strategies at Scale: LLM Fine-Tuning Beyond Reinforcement
Learning
Paper
• 2509.24372
• Published
• 11
Paper
• 2508.10104
• Published
• 298
MATRIX: Mask Track Alignment for Interaction-aware Video Generation
Paper
• 2510.07310
• Published
• 36
Real-Time Object Detection Meets DINOv3
Paper
• 2509.20787
• Published
• 11
A Survey of Reinforcement Learning for Large Reasoning Models
Paper
• 2509.08827
• Published
• 190
A Survey of Context Engineering for Large Language Models
Paper
• 2507.13334
• Published
• 261
Scaling RL to Long Videos
Paper
• 2507.07966
• Published
• 160
T-LoRA: Single Image Diffusion Model Customization Without Overfitting
Paper
• 2507.05964
• Published
• 120
SingLoRA: Low Rank Adaptation Using a Single Matrix
Paper
• 2507.05566
• Published
• 115
Easy Dataset: A Unified and Extensible Framework for Synthesizing LLM
Fine-Tuning Data from Unstructured Documents
Paper
• 2507.04009
• Published
• 54
Radial Attention: O(nlog n) Sparse Attention with Energy Decay for
Long Video Generation
Paper
• 2506.19852
• Published
• 42
KV Cache Steering for Inducing Reasoning in Small Language Models
Paper
• 2507.08799
• Published
• 40
PartCrafter: Structured 3D Mesh Generation via Compositional Latent
Diffusion Transformers
Paper
• 2506.05573
• Published
• 82
Qwen3 Embedding: Advancing Text Embedding and Reranking Through
Foundation Models
Paper
• 2506.05176
• Published
• 79
ComfyUI-R1: Exploring Reasoning Models for Workflow Generation
Paper
• 2506.09790
• Published
• 53
SpatialLM: Training Large Language Models for Structured Indoor Modeling
Paper
• 2506.07491
• Published
• 50
Paper
• 2505.09388
• Published
• 338
BLIP3-o: A Family of Fully Open Unified Multimodal Models-Architecture,
Training and Dataset
Paper
• 2505.09568
• Published
• 99
Flow-GRPO: Training Flow Matching Models via Online RL
Paper
• 2505.05470
• Published
• 88
Distilling LLM Agent into Small Models with Retrieval and Code Tools
Paper
• 2505.17612
• Published
• 81
ZeroSearch: Incentivize the Search Capability of LLMs without Searching
Paper
• 2505.04588
• Published
• 65
dx8152/Qwen-Edit-2509-Multiple-angles
Image-to-Image
• Updated
• 90.8k
• • 912
Qwen3-TTS Technical Report
Paper
• 2601.15621
• Published
• 70