XSkill: Continual Learning from Experience and Skills in Multimodal Agents Paper • 2603.12056 • Published 3 days ago • 20
microsoft/llmlingua-2-bert-base-multilingual-cased-meetingbank Token Classification • 0.2B • Updated Jan 8, 2025 • 130k • • 48
UloRL:An Ultra-Long Output Reinforcement Learning Approach for Advancing Large Language Models' Reasoning Abilities Paper • 2507.19766 • Published Jul 26, 2025 • 15
Foundation Text-Generation Models Below 360M Parameters Collection Great candidates for fine-tuning targeting Wllama and Transformers.js for mobile devices, ordered by number of parameters. • 42 items • Updated Jan 25 • 41