Collections
Discover the best community collections!
Collections trending this week
-
Nemotron-Cascade 2: Post-Training LLMs with Cascade RL and Multi-Domain On-Policy Distillation
Paper • 2603.19220 • Published • 69 -
Not Every Rubric Teaches Equally: Policy-Aware Rubric Rewards for RLVR
Paper • 2605.20164 • Published • 6 -
GoLongRL: Capability-Oriented Long Context Reinforcement Learning with Multitask Alignment
Paper • 2605.19577 • Published • 59 -
EnvFactory: Scaling Tool-Use Agents via Executable Environments Synthesis and Robust RL
Paper • 2605.18703 • Published • 50
-
Hibiki Zero Samples
🏆13Demo samples of the speech translation model Hibiki-Zero.
-
Simultaneous Speech-to-Speech Translation Without Aligned Data
Paper • 2602.11072 • Published • 2 -
kyutai/Audio-NTREX-4L
Viewer • Updated • 3.6k • 744 • 5 -
kyutai/hibiki-zero-3b-pytorch-bf16
Audio-to-Audio • Updated • 2.09k • 56
-
ResearchClawBench: A Benchmark for End-to-End Autonomous Scientific Research
Paper • 2606.07591 • Published • 90 -
InternScience/ResearchClawBench
Benchmark • Updated • 57 • 3.71k • 6 -
ResearchHarness
🚀1Lightweight harness for tool-using LLM agents.
-
ResearchClawBench Task Submission
📦Submit and validate a ResearchClawBench task ZIP
-
dealignai/MiniMax-M2.7-JANGTQ-CRACK
Text Generation • 15B • Updated • 1.51k • 27 -
dealignai/Nemotron-3-Super-120B-A12B-UNCENSORED-JANG_2L
Text Generation • 13B • Updated • 2.81k • 11 -
dealignai/Gemma-4-26B-A4B-JANG_2L-CRACK
Image-Text-to-Text • 3B • Updated • 3.04k • 86 -
dealignai/Qwen3.5-VL-397B-A17B-UNCENSORED-JANG_1L
Image-Text-to-Text • 34B • Updated • 552 • 5
-
Continuous Latent Diffusion Language Model
Paper • 2605.06548 • Published • 82 -
Scaling Latent Reasoning via Looped Language Models
Paper • 2510.25741 • Published • 231 -
Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach
Paper • 2502.05171 • Published • 157 -
Pretraining Language Models to Ponder in Continuous Space
Paper • 2505.20674 • Published • 3
-
ResearchClawBench: A Benchmark for End-to-End Autonomous Scientific Research
Paper • 2606.07591 • Published • 90 -
InternScience/ResearchClawBench
Benchmark • Updated • 57 • 3.71k • 6 -
ResearchHarness
🚀1Lightweight harness for tool-using LLM agents.
-
ResearchClawBench Task Submission
📦Submit and validate a ResearchClawBench task ZIP
-
Nemotron-Cascade 2: Post-Training LLMs with Cascade RL and Multi-Domain On-Policy Distillation
Paper • 2603.19220 • Published • 69 -
Not Every Rubric Teaches Equally: Policy-Aware Rubric Rewards for RLVR
Paper • 2605.20164 • Published • 6 -
GoLongRL: Capability-Oriented Long Context Reinforcement Learning with Multitask Alignment
Paper • 2605.19577 • Published • 59 -
EnvFactory: Scaling Tool-Use Agents via Executable Environments Synthesis and Robust RL
Paper • 2605.18703 • Published • 50
-
dealignai/MiniMax-M2.7-JANGTQ-CRACK
Text Generation • 15B • Updated • 1.51k • 27 -
dealignai/Nemotron-3-Super-120B-A12B-UNCENSORED-JANG_2L
Text Generation • 13B • Updated • 2.81k • 11 -
dealignai/Gemma-4-26B-A4B-JANG_2L-CRACK
Image-Text-to-Text • 3B • Updated • 3.04k • 86 -
dealignai/Qwen3.5-VL-397B-A17B-UNCENSORED-JANG_1L
Image-Text-to-Text • 34B • Updated • 552 • 5
-
Hibiki Zero Samples
🏆13Demo samples of the speech translation model Hibiki-Zero.
-
Simultaneous Speech-to-Speech Translation Without Aligned Data
Paper • 2602.11072 • Published • 2 -
kyutai/Audio-NTREX-4L
Viewer • Updated • 3.6k • 744 • 5 -
kyutai/hibiki-zero-3b-pytorch-bf16
Audio-to-Audio • Updated • 2.09k • 56
-
Continuous Latent Diffusion Language Model
Paper • 2605.06548 • Published • 82 -
Scaling Latent Reasoning via Looped Language Models
Paper • 2510.25741 • Published • 231 -
Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach
Paper • 2502.05171 • Published • 157 -
Pretraining Language Models to Ponder in Continuous Space
Paper • 2505.20674 • Published • 3