DelTA: Discriminative Token Credit Assignment for Reinforcement Learning from Verifiable Rewards Paper • 2605.21467 • Published 3 days ago • 93
OCTOPUS: Optimized KV Cache for Transformers via Octahedral Parametrization Under optimal Squared error quantization Paper • 2605.21226 • Published 3 days ago • 8
Prior-Aligned Data Cleaning for Tabular Foundation Models Paper • 2604.25154 • Published 25 days ago • 4
DV-World: Benchmarking Data Visualization Agents in Real-World Scenarios Paper • 2604.25914 • Published 25 days ago • 41
LLaDA2.0-Uni: Unifying Multimodal Understanding and Generation with Diffusion Large Language Model Paper • 2604.20796 • Published about 1 month ago • 240
Adam's Law: Textual Frequency Law on Large Language Models Paper • 2604.02176 • Published Apr 2 • 503
electricsheepafrica/africa-world-bank-trade-indicators-for-niger Viewer • Updated Apr 12 • 5.31k • 108 • 1
An Efficient Heterogeneous Co-Design for Fine-Tuning on a Single GPU Paper • 2603.16428 • Published Mar 17 • 51