MLEB Dataset Collection A Large-Scale Benchmark for Legal Text Embeddings • 8 items • Updated 3 days ago • 2
Mecellem Models Collection Project: Turkish Embeddings from Scratch and CPT Decoders Infrastructure: MareNostrum 5 (BSC) • 28 items • Updated 2 days ago • 2
ahmeterdempmk/qwen3-30b-a3b-gdpo-rslora-rlaif-turkish-call-center Text Generation • 31B • Updated 3 days ago • 48
ahmeterdempmk/qwen3-30b-a3b-gdpo-rslora-rlaif-turkish-call-center Text Generation • 31B • Updated 3 days ago • 48
ahmeterdempmk/qwen3-30b-a3b-gdpo-rlaif-turkish-call-center Text Generation • 31B • Updated 5 days ago • 54
ahmeterdempmk/qwen3-30b-a3b-gdpo-rlaif-turkish-call-center Text Generation • 31B • Updated 5 days ago • 54
ahmeterdempmk/qwen3-4b-gdpo-rlaif-turkish-call-center-16bit-full Text Generation • 4B • Updated 7 days ago • 21
ahmeterdempmk/qwen3-4b-gdpo-rlaif-turkish-call-center-16bit-full Text Generation • 4B • Updated 7 days ago • 21
ahmeterdempmk/qwen3-4b-gdpo-rlaif-turkish-call-center-16bit Text Generation • 4B • Updated 7 days ago • 22
ahmeterdempmk/qwen3-4b-gdpo-rlaif-turkish-call-center-16bit Text Generation • 4B • Updated 7 days ago • 22
ahmeterdempmk/DeepLabV3Plus-With-EfficientNetB4-Encoder-50-Epoch-Polyp-Segmentation-0.9793-IoU Image Segmentation • Updated Feb 28, 2025
ahmeterdempmk/DeepLabV3Plus-with-EfficientNet-B4-150-Epoch-KTO-0.8889-F1-5.38-SMAPE Updated Jul 1, 2025
GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization Paper • 2601.05242 • Published 18 days ago • 208