tencent/Penguin-VL-8B
Text Generation • 9B • Updated • 417 • 74
None defined yet.
A$^2$TGPO: Agentic Turn-Group Policy Optimization with Adaptive Turn-level Clipping
MiA-Signature: Approximating Global Activation for Long-Context Understanding