google/timesfm-2.5-200m-transformers Time Series Forecasting • 0.2B • Updated 23 days ago • 10.3k • 37
nvidia/NVIDIA-Nemotron-3-Super-120B-A12B-FP8 Text Generation • 124B • Updated 4 days ago • 952k • 204
pplx-embed Collection Diffusion-Pretrained Dense and Contextual Embeddings • 7 items • Updated about 1 month ago • 95
GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization Paper • 2601.05242 • Published Jan 8 • 229
Qwen3-VL-Embedding and Qwen3-VL-Reranker: A Unified Framework for State-of-the-Art Multimodal Retrieval and Ranking Paper • 2601.04720 • Published Jan 8 • 57