Paul S PRO
SuperPauly
AI & ML interests
None yet
Recent Activity
liked a model about 13 hours ago
owensong/Inflect-Nano-v1 new activity 2 days ago
Zeyue7/Audio-Omni:Demo is broke liked a model 2 days ago
HKUSTAudio/AudioX-TurboOrganizations
None yet
Sample Upscaling & Denoising.
Evaluation Methods & Metrics
-
RubricBench: Aligning Model-Generated Rubrics with Human Standards
Paper • 2603.01562 • Published • 64 -
T2S-Bench & Structure-of-Thought: Benchmarking and Prompting Comprehensive Text-to-Structure Reasoning
Paper • 2603.03790 • Published • 122 -
SWE-rebench: An Automated Pipeline for Task Collection and Decontaminated Evaluation of Software Engineering Agents
Paper • 2505.20411 • Published • 97 -
SWE-rebench V2: Language-Agnostic SWE Task Collection at Scale
Paper • 2602.23866 • Published • 91
Py
Pre-Training Tokens
-
HRM-Text: Efficient Pretraining Beyond Scaling
Paper • 2605.20613 • Published • 319 -
LoopUS: Recasting Pretrained LLMs into Looped Latent Refinement Models
Paper • 2605.11011 • Published • 10 -
Less is More: Recursive Reasoning with Tiny Networks
Paper • 2510.04871 • Published • 517 -
Characterizing, Evaluating, and Optimizing Complex Reasoning
Paper • 2602.08498 • Published
Demixing Models & Datasets
-
Moisesdb: A dataset for source separation beyond 4-stems
Paper • 2307.15913 • Published • 1 -
Music Source Separation with Band-Split RoPE Transformer
Paper • 2309.02612 • Published • 2 -
Hybrid Transformers for Music Source Separation
Paper • 2211.08553 • Published • 1 -
nvidia/RE-USE
Audio-to-Audio • 9.61M • Updated • 5.9k • 77
Agent Loops, Character, Work Ethics & Behavior
-
Close the Loop: Synthesizing Infinite Tool-Use Data via Multi-Agent Role-Playing
Paper • 2512.23611 • Published • 7 -
Context as a Tool: Context Management for Long-Horizon SWE-Agents
Paper • 2512.22087 • Published • 4 -
AgentScope 1.0: A Developer-Centric Framework for Building Agentic Applications
Paper • 2508.16279 • Published • 66 -
Very Large-Scale Multi-Agent Simulation in AgentScope
Paper • 2407.17789 • Published • 44
Music
Pre-Training Tokens
-
HRM-Text: Efficient Pretraining Beyond Scaling
Paper • 2605.20613 • Published • 319 -
LoopUS: Recasting Pretrained LLMs into Looped Latent Refinement Models
Paper • 2605.11011 • Published • 10 -
Less is More: Recursive Reasoning with Tiny Networks
Paper • 2510.04871 • Published • 517 -
Characterizing, Evaluating, and Optimizing Complex Reasoning
Paper • 2602.08498 • Published
Sample Upscaling & Denoising.
Demixing Models & Datasets
-
Moisesdb: A dataset for source separation beyond 4-stems
Paper • 2307.15913 • Published • 1 -
Music Source Separation with Band-Split RoPE Transformer
Paper • 2309.02612 • Published • 2 -
Hybrid Transformers for Music Source Separation
Paper • 2211.08553 • Published • 1 -
nvidia/RE-USE
Audio-to-Audio • 9.61M • Updated • 5.9k • 77
Evaluation Methods & Metrics
-
RubricBench: Aligning Model-Generated Rubrics with Human Standards
Paper • 2603.01562 • Published • 64 -
T2S-Bench & Structure-of-Thought: Benchmarking and Prompting Comprehensive Text-to-Structure Reasoning
Paper • 2603.03790 • Published • 122 -
SWE-rebench: An Automated Pipeline for Task Collection and Decontaminated Evaluation of Software Engineering Agents
Paper • 2505.20411 • Published • 97 -
SWE-rebench V2: Language-Agnostic SWE Task Collection at Scale
Paper • 2602.23866 • Published • 91
Agent Loops, Character, Work Ethics & Behavior
-
Close the Loop: Synthesizing Infinite Tool-Use Data via Multi-Agent Role-Playing
Paper • 2512.23611 • Published • 7 -
Context as a Tool: Context Management for Long-Horizon SWE-Agents
Paper • 2512.22087 • Published • 4 -
AgentScope 1.0: A Developer-Centric Framework for Building Agentic Applications
Paper • 2508.16279 • Published • 66 -
Very Large-Scale Multi-Agent Simulation in AgentScope
Paper • 2407.17789 • Published • 44
Py