World Action Models: The Next Frontier in Embodied AI Paper • 2605.12090 • Published May 12 • 68
The Past Is Not Past: Memory-Enhanced Dynamic Reward Shaping Paper • 2604.11297 • Published Apr 13 • 144
BandPO: Bridging Trust Regions and Ratio Clipping via Probability-Aware Bounds for LLM Reinforcement Learning Paper • 2603.04918 • Published Mar 5 • 56
MOSS-Audio-Tokenizer: Scaling Audio Tokenizers for Future Audio Foundation Models Paper • 2602.10934 • Published Feb 11 • 50
OpenMOSS-Team/MOSS-Audio-Tokenizer Image Feature Extraction • 2B • Updated 22 days ago • 242k • 46