TAB-PO: Preference Optimization with a Token-Level Adaptive Barrier for Token-Critical Structured Generation Paper • 2603.00025 • Published Feb 3 • 1
gpt-oss Collection Open-weight models designed for powerful reasoning, agentic tasks, and versatile developer use cases. • 2 items • Updated Aug 7, 2025 • 452