-
Jackrong/Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled
Image-Text-to-Text • 28B • Updated • 552k • 2.43k -
HauhauCS/Qwen3.5-35B-A3B-Uncensored-HauhauCS-Aggressive
Image-Text-to-Text • 35B • Updated • 779k • 1.21k -
huihui-ai/Huihui-Qwen3.5-27B-Claude-4.6-Opus-abliterated
Image-Text-to-Text • 28B • Updated • 19.5k • 104 -
Jackrong/Qwen3.5-4B-Claude-4.6-Opus-Reasoning-Distilled-GGUF
Text Generation • 4B • Updated • 274k • 91
Collections
Discover the best community collections!
Collections trending this week
-
RLAnything: Forge Environment, Policy, and Reward Model in Completely Dynamic RL System
Paper • 2602.02488 • Published • 36 -
RLeXplore: Accelerating Research in Intrinsically-Motivated Reinforcement Learning
Paper • 2405.19548 • Published • 1 -
Learning Decentralized LLM Collaboration with Multi-Agent Actor Critic
Paper • 2601.21972 • Published • 1 -
SAFE: Stable Alignment Finetuning with Entropy-Aware Predictive Control for RLHF
Paper • 2602.04651 • Published • 1
-
Jackrong/Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled
Image-Text-to-Text • 28B • Updated • 552k • 2.43k -
HauhauCS/Qwen3.5-35B-A3B-Uncensored-HauhauCS-Aggressive
Image-Text-to-Text • 35B • Updated • 779k • 1.21k -
huihui-ai/Huihui-Qwen3.5-27B-Claude-4.6-Opus-abliterated
Image-Text-to-Text • 28B • Updated • 19.5k • 104 -
Jackrong/Qwen3.5-4B-Claude-4.6-Opus-Reasoning-Distilled-GGUF
Text Generation • 4B • Updated • 274k • 91
-
RLAnything: Forge Environment, Policy, and Reward Model in Completely Dynamic RL System
Paper • 2602.02488 • Published • 36 -
RLeXplore: Accelerating Research in Intrinsically-Motivated Reinforcement Learning
Paper • 2405.19548 • Published • 1 -
Learning Decentralized LLM Collaboration with Multi-Agent Actor Critic
Paper • 2601.21972 • Published • 1 -
SAFE: Stable Alignment Finetuning with Entropy-Aware Predictive Control for RLHF
Paper • 2602.04651 • Published • 1