IAPO Checkpoints of paper "IAPO: Information-Aware Policy Optimization for Token-Efficient Reasoning" Collection by jonathanhe123 Feb 22 2
ColBERT-Zero 🐶 First large-scale fully pre-trained ColBERT model using only public data, outperforming GTE-ModernColBERT and GTE-ModernBERT Collection by lightonai 22 days ago 23 ColBERT-Zero: To Pre-train Or Not To Pre-train ColBERT models Paper • 2602.16609 • Published Feb 18 • 9 lightonai/ColBERT-Zero Sentence Similarity • 0.1B • Updated Feb 23 • 2.86k • • 41 lightonai/ColBERT-Zero-supervised Sentence Similarity • 0.1B • Updated Feb 23 • 60 • 3 lightonai/ColBERT-Zero-unsupervised Sentence Similarity • 0.1B • Updated Feb 23 • 196 • 2
ColBERT-Zero: To Pre-train Or Not To Pre-train ColBERT models Paper • 2602.16609 • Published Feb 18 • 9
Coding Agents Collection by openenv Feb 20 1 Running RL 14 Coding Environment Server 💻 14 Run Python code step-by-step in a coding environment Sleeping RL TB2 Environment Server 🧪 Interact with an OpenEnv agentic environment Runtime error Julia_env Environment Server 🐳 Infatoshi/kernrl-training Reinforcement Learning • Updated Jan 20
Mobile-O-Datasets This collection includes the pre-training, sft, and post-training data of Mobile-O Collection by Amshaker Feb 14 4 Amshaker/Mobile-O-Pre-Train Viewer • Updated Feb 24 • 22.8M • 4.83k • 11 Amshaker/Mobile-O-SFT Viewer • Updated Feb 24 • 7.11k • 265 • 5 Amshaker/Mobile-O-Post-Train Viewer • Updated Feb 24 • 7k • 455 • 13
coding Collection by darxtrix Feb 11 2 SWE-Universe: Scale Real-World Verifiable Environments to Millions Paper • 2602.02361 • Published Feb 2 • 61 LongCodeZip: Compress Long Context for Code Language Models Paper • 2510.00446 • Published Oct 1, 2025 • 108 Code2World: A GUI World Model via Renderable Code Generation Paper • 2602.09856 • Published Feb 10 • 201 Terminal-Bench: Benchmarking Agents on Hard, Realistic Tasks in Command Line Interfaces Paper • 2601.11868 • Published Jan 17 • 37
SWE-Universe: Scale Real-World Verifiable Environments to Millions Paper • 2602.02361 • Published Feb 2 • 61
LongCodeZip: Compress Long Context for Code Language Models Paper • 2510.00446 • Published Oct 1, 2025 • 108
Code2World: A GUI World Model via Renderable Code Generation Paper • 2602.09856 • Published Feb 10 • 201
Terminal-Bench: Benchmarking Agents on Hard, Realistic Tasks in Command Line Interfaces Paper • 2601.11868 • Published Jan 17 • 37
GUI-Libra Training GUI agents with augmented reasoning data and a tailored post-training recipe Collection by Ray2333 Apr 25 1 Ray2333/GUI-Libra-3B 4B • Updated Mar 1 • 3 Ray2333/Libra-81K-SFT Updated Mar 31 • 93 • 1 Ray2333/Offline_Evaluation Viewer • Updated Feb 22 • 35.2k • 17 Ray2333/Libra-81K Viewer • Updated Feb 20 • 738 • 33 • 1
Opensource datasets Collection by hivetrace Mar 11 3 hivetrace/prompt-2-prompt-injection-v2-dataset-ru Viewer • Updated Feb 11 • 22.1k • 271 • 3
Mobile-O-Models This collection contains all models of Mobile-O project Collection by Amshaker Feb 20 3 Amshaker/Mobile-O-0.5B Text-to-Image • 2B • Updated Feb 24 • 343 • 11 Amshaker/Mobile-O-1.5B Text-to-Image • 4B • Updated Feb 24 • 181 • 11 Amshaker/Mobile-O-0.5B-iOS Image-Text-to-Text • Updated 23 days ago • 220 • 13
Transcoder Adapters for Reasoning-Model Diffing trained adapters and feature data for https://arxiv.org/abs/2602.20904 (https://transcoder-adapters.github.io) Collection by nathu0 Feb 25 2 nathu0/transcoder-adapters-R1-Distill-Qwen-7B-l1w0.0001-l0-0.1 9B • Updated Feb 14 • 2 nathu0/transcoder-adapters-openthoughts3-stratified-55k Viewer • Updated Feb 13 • 54.9k • 15 • 1 nathu0/transcoder-adapters-R1-Distill-Qwen-7B-l1w0.01-l0-10.3 9B • Updated Feb 14 • 2 nathu0/transcoder-adapters-R1-Distill-Qwen-7B-l1w0.003-l0-4.3 9B • Updated Feb 14 • 1
GLM-5 Collection by zai-org Feb 11 37 zai-org/GLM-5 Text Generation • 754B • Updated Apr 5 • 65.6k • • 2.1k zai-org/GLM-5-FP8 Text Generation • 754B • Updated Apr 5 • 1.88M • 181
IAPO Checkpoints of paper "IAPO: Information-Aware Policy Optimization for Token-Efficient Reasoning" Collection by jonathanhe123 Feb 22 2
GUI-Libra Training GUI agents with augmented reasoning data and a tailored post-training recipe Collection by Ray2333 Apr 25 1 Ray2333/GUI-Libra-3B 4B • Updated Mar 1 • 3 Ray2333/Libra-81K-SFT Updated Mar 31 • 93 • 1 Ray2333/Offline_Evaluation Viewer • Updated Feb 22 • 35.2k • 17 Ray2333/Libra-81K Viewer • Updated Feb 20 • 738 • 33 • 1
ColBERT-Zero 🐶 First large-scale fully pre-trained ColBERT model using only public data, outperforming GTE-ModernColBERT and GTE-ModernBERT Collection by lightonai 22 days ago 23 ColBERT-Zero: To Pre-train Or Not To Pre-train ColBERT models Paper • 2602.16609 • Published Feb 18 • 9 lightonai/ColBERT-Zero Sentence Similarity • 0.1B • Updated Feb 23 • 2.86k • • 41 lightonai/ColBERT-Zero-supervised Sentence Similarity • 0.1B • Updated Feb 23 • 60 • 3 lightonai/ColBERT-Zero-unsupervised Sentence Similarity • 0.1B • Updated Feb 23 • 196 • 2
ColBERT-Zero: To Pre-train Or Not To Pre-train ColBERT models Paper • 2602.16609 • Published Feb 18 • 9
Opensource datasets Collection by hivetrace Mar 11 3 hivetrace/prompt-2-prompt-injection-v2-dataset-ru Viewer • Updated Feb 11 • 22.1k • 271 • 3
Coding Agents Collection by openenv Feb 20 1 Running RL 14 Coding Environment Server 💻 14 Run Python code step-by-step in a coding environment Sleeping RL TB2 Environment Server 🧪 Interact with an OpenEnv agentic environment Runtime error Julia_env Environment Server 🐳 Infatoshi/kernrl-training Reinforcement Learning • Updated Jan 20
Mobile-O-Models This collection contains all models of Mobile-O project Collection by Amshaker Feb 20 3 Amshaker/Mobile-O-0.5B Text-to-Image • 2B • Updated Feb 24 • 343 • 11 Amshaker/Mobile-O-1.5B Text-to-Image • 4B • Updated Feb 24 • 181 • 11 Amshaker/Mobile-O-0.5B-iOS Image-Text-to-Text • Updated 23 days ago • 220 • 13
Mobile-O-Datasets This collection includes the pre-training, sft, and post-training data of Mobile-O Collection by Amshaker Feb 14 4 Amshaker/Mobile-O-Pre-Train Viewer • Updated Feb 24 • 22.8M • 4.83k • 11 Amshaker/Mobile-O-SFT Viewer • Updated Feb 24 • 7.11k • 265 • 5 Amshaker/Mobile-O-Post-Train Viewer • Updated Feb 24 • 7k • 455 • 13
Transcoder Adapters for Reasoning-Model Diffing trained adapters and feature data for https://arxiv.org/abs/2602.20904 (https://transcoder-adapters.github.io) Collection by nathu0 Feb 25 2 nathu0/transcoder-adapters-R1-Distill-Qwen-7B-l1w0.0001-l0-0.1 9B • Updated Feb 14 • 2 nathu0/transcoder-adapters-openthoughts3-stratified-55k Viewer • Updated Feb 13 • 54.9k • 15 • 1 nathu0/transcoder-adapters-R1-Distill-Qwen-7B-l1w0.01-l0-10.3 9B • Updated Feb 14 • 2 nathu0/transcoder-adapters-R1-Distill-Qwen-7B-l1w0.003-l0-4.3 9B • Updated Feb 14 • 1
coding Collection by darxtrix Feb 11 2 SWE-Universe: Scale Real-World Verifiable Environments to Millions Paper • 2602.02361 • Published Feb 2 • 61 LongCodeZip: Compress Long Context for Code Language Models Paper • 2510.00446 • Published Oct 1, 2025 • 108 Code2World: A GUI World Model via Renderable Code Generation Paper • 2602.09856 • Published Feb 10 • 201 Terminal-Bench: Benchmarking Agents on Hard, Realistic Tasks in Command Line Interfaces Paper • 2601.11868 • Published Jan 17 • 37
SWE-Universe: Scale Real-World Verifiable Environments to Millions Paper • 2602.02361 • Published Feb 2 • 61
LongCodeZip: Compress Long Context for Code Language Models Paper • 2510.00446 • Published Oct 1, 2025 • 108
Code2World: A GUI World Model via Renderable Code Generation Paper • 2602.09856 • Published Feb 10 • 201
Terminal-Bench: Benchmarking Agents on Hard, Realistic Tasks in Command Line Interfaces Paper • 2601.11868 • Published Jan 17 • 37
GLM-5 Collection by zai-org Feb 11 37 zai-org/GLM-5 Text Generation • 754B • Updated Apr 5 • 65.6k • • 2.1k zai-org/GLM-5-FP8 Text Generation • 754B • Updated Apr 5 • 1.88M • 181