Collections
Discover the best community collections!
Collections trending this week
-
DavidAU/Llama-3.2-8X3B-MOE-Dark-Champion-Instruct-uncensored-abliterated-18.4B-GGUF
Text Generation • 18B • Updated • 43.1k • 571 -
DavidAU/Gemma-The-Writer-N-Restless-Quill-10B-Uncensored-GGUF
Text Generation • 10B • Updated • 6.26k • 144 -
DavidAU/Llama-3.2-8X4B-MOE-V2-Dark-Champion-Instruct-uncensored-abliterated-21B-GGUF
Text Generation • 21B • Updated • 11.1k • 138 -
DavidAU/L3-Stheno-Maid-Blackroot-Grand-HORROR-16B-GGUF
Text Generation • 17B • Updated • 2.33k • 94
-
Direct-Preference-Optimization-Datasets-explorer
💻2Browse and view datasets from a Hugging Face collection
-
argilla/distilabel-capybara-dpo-7k-binarized
Viewer • Updated • 7.56k • 5.87k • 184 -
HuggingFaceH4/ultrafeedback_binarized
Viewer • Updated • 187k • 11.8k • 341 -
allenai/reward-bench
Viewer • Updated • 8.11k • 4.78k • 109
-
mlabonne/gemma-3-27b-it-abliterated
Image-Text-to-Text • 27B • Updated • 2.59k • • 333 -
mlabonne/gemma-3-27b-it-abliterated-GGUF
Image-Text-to-Text • 27B • Updated • 7.19k • 250 -
mlabonne/gemma-3-12b-it-abliterated-v2
Image-Text-to-Text • 12B • Updated • 490 • 21 -
mlabonne/gemma-3-4b-it-abliterated-v2
Image-Text-to-Text • 4B • Updated • 55 • 11
-
Diffusion Augmented Agents: A Framework for Efficient Exploration and Transfer Learning
Paper • 2407.20798 • Published • 24 -
Offline Reinforcement Learning for LLM Multi-Step Reasoning
Paper • 2412.16145 • Published • 38 -
REINFORCE++: A Simple and Efficient Approach for Aligning Large Language Models
Paper • 2501.03262 • Published • 104 -
SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution
Paper • 2502.18449 • Published • 75
-
mlabonne/gemma-3-27b-it-abliterated
Image-Text-to-Text • 27B • Updated • 2.59k • • 333 -
mlabonne/gemma-3-27b-it-abliterated-GGUF
Image-Text-to-Text • 27B • Updated • 7.19k • 250 -
mlabonne/gemma-3-12b-it-abliterated-v2
Image-Text-to-Text • 12B • Updated • 490 • 21 -
mlabonne/gemma-3-4b-it-abliterated-v2
Image-Text-to-Text • 4B • Updated • 55 • 11
-
DavidAU/Llama-3.2-8X3B-MOE-Dark-Champion-Instruct-uncensored-abliterated-18.4B-GGUF
Text Generation • 18B • Updated • 43.1k • 571 -
DavidAU/Gemma-The-Writer-N-Restless-Quill-10B-Uncensored-GGUF
Text Generation • 10B • Updated • 6.26k • 144 -
DavidAU/Llama-3.2-8X4B-MOE-V2-Dark-Champion-Instruct-uncensored-abliterated-21B-GGUF
Text Generation • 21B • Updated • 11.1k • 138 -
DavidAU/L3-Stheno-Maid-Blackroot-Grand-HORROR-16B-GGUF
Text Generation • 17B • Updated • 2.33k • 94
-
Diffusion Augmented Agents: A Framework for Efficient Exploration and Transfer Learning
Paper • 2407.20798 • Published • 24 -
Offline Reinforcement Learning for LLM Multi-Step Reasoning
Paper • 2412.16145 • Published • 38 -
REINFORCE++: A Simple and Efficient Approach for Aligning Large Language Models
Paper • 2501.03262 • Published • 104 -
SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution
Paper • 2502.18449 • Published • 75
-
Direct-Preference-Optimization-Datasets-explorer
💻2Browse and view datasets from a Hugging Face collection
-
argilla/distilabel-capybara-dpo-7k-binarized
Viewer • Updated • 7.56k • 5.87k • 184 -
HuggingFaceH4/ultrafeedback_binarized
Viewer • Updated • 187k • 11.8k • 341 -
allenai/reward-bench
Viewer • Updated • 8.11k • 4.78k • 109