view article Article BigCodeArena: Judging code generations end to end with code executions Oct 7, 2025 β’ 19
BigCodeArena: Unveiling More Reliable Human Preferences in Code Generation via Execution Paper β’ 2510.08697 β’ Published Oct 9, 2025 β’ 37
Privacy-Preserving Tabular Synthetic Data Generation Using TabularARGN Paper β’ 2508.06647 β’ Published Aug 8, 2025 β’ 17
TabularARGN: A Flexible and Efficient Auto-Regressive Framework for Generating High-Fidelity Synthetic Data Paper β’ 2501.12012 β’ Published Jan 21, 2025 β’ 9
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning Paper β’ 2501.12948 β’ Published Jan 22, 2025 β’ 436
view article Article I Clicked βI Agreeβ, But What Am I Really Consenting To? Mar 26, 2025 β’ 24
view article Article Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM +2 Mar 12, 2025 β’ 484
The Good, The Bad, and The Greedy: Evaluation of LLMs Should Not Ignore Non-Determinism Paper β’ 2407.10457 β’ Published Jul 15, 2024 β’ 24
Portrait4D-v2: Pseudo Multi-View Data Creates Better 4D Head Synthesizer Paper β’ 2403.13570 β’ Published Mar 20, 2024 β’ 3
Instruction Pre-Training: Language Models are Supervised Multitask Learners Paper β’ 2406.14491 β’ Published Jun 20, 2024 β’ 96