Models
Datasets
Spaces
Buckets new
Docs
Enterprise
Pricing
- Website
- Community
- Solutions
Log In
Sign Up

Collections

Discover the best community collections!

Collections trending this week

Retrofitting Recurrence

Teaching Pretrained Language Models to Think Deeper with Retrofitted Recurrence

Paper • 2511.07384 • Published Nov 10, 2025 • 20
smcleish/Recurrent-Llama-3.2-train-recurrence-32

Text Generation • 1B • Updated Nov 11, 2025 • 125 • 1
smcleish/Recurrent-Llama-3.2-train-recurrence-16

Text Generation • 1B • Updated Nov 11, 2025 • 13
smcleish/Recurrent-Llama-3.2-train-recurrence-8

Text Generation • 1B • Updated Nov 11, 2025 • 52

Content moderation models and datasets - 2025

Models and datasets that support automatic content moderation

A Holistic Approach to Undesired Content Detection in the Real World

Paper • 2208.03274 • Published Aug 5, 2022
ToxicChat: Unveiling Hidden Challenges of Toxicity Detection in Real-World User-AI Conversation

Paper • 2310.17389 • Published Oct 26, 2023
gpt-oss-safeguard

Collection

gpt-oss-safeguard-120b and gpt-oss-safeguard-20b are safety reasoning models built-upon gpt-oss • 2 items • Updated Oct 29, 2025 • 73
GPT-OSS-safeguard:20b

Collection

MLX Based GPT-OSS-Safeguard models • 5 items • Updated Nov 1, 2025

Paza is a collection of speech models & benchmarks for low resource languages by the Microsoft Research Africa - Nairobi Lab

Running

Agents

20

PazaBench

🥇

20

ASR Leaderboard for low resource languages
microsoft/paza-Phi-4-multimodal-instruct

Automatic Speech Recognition • 6B • Updated Feb 4 • 157 • 4
microsoft/paza-whisper-large-v3-turbo

Automatic Speech Recognition • 0.8B • Updated Feb 4 • 458 • 8

about 17 hours ago

datalab-to/chandra

Image-Text-to-Text • 9B • Updated Mar 26 • 207k • 525
datalab-to/chandra-ocr-2

Image-Text-to-Text • 5B • Updated Mar 18 • 2M • 391
datalab-to/lift

Image-Text-to-Text • 10B • Updated about 22 hours ago • 68
datalab-to/surya-ocr-2

Image-Text-to-Text • 0.7B • Updated 24 days ago • 327k • 47

From Data Curation to Reinforcement Learning: Building a Strong Grounding Model for Computer-Use Agents

mlfoundations/Gelato-30B-A3B

Image-Text-to-Text • 31B • Updated Nov 15, 2025 • 74 • 33
mlfoundations/Click-100k

Viewer • Updated Nov 11, 2025 • 101k • 827 • 18
🍨 Gelato-30B-A3B Checkpoints

Collection

Intermediate checkpoints for Gelato-30B-A3B. Refer to https://github.com/mlfoundations/gelato for more details. • 29 items • Updated Oct 29, 2025 • 1
mlfoundations/gelato-osworld-agent-trajectories

Viewer • Updated Nov 6, 2025 • 13.5k • 313 • 2

Running

598

NSFW Image Generator

😻

598

Uncensored AI Image Generator
Running

Agents

325

Miragic AI Image Generator

🎨

325

Create stunning AI-generated images with text prompts
Runtime error

Agents

4

Nfsw Image

🖼

4

Generate images from text prompts
Runtime error

Agents

3

Nudes

🖼

3

Test

Pairwise Rotation Quantization for Efficient Reasoning LLM Inference

ParoQuant: Pairwise Rotation Quantization for Efficient Reasoning LLM Inference

Paper • 2511.10645 • Published Nov 13, 2025 • 13
z-lab/Qwen3.6-27B-PARO

Image-Text-to-Text • 6B • Updated May 15 • 5.79k • 27
z-lab/Qwen3.6-35B-A3B-PARO

Image-Text-to-Text • 6B • Updated May 15 • 3.47k • 6
z-lab/gemma-4-31B-it-PARO

Image-Text-to-Text • 6B • Updated May 15 • 1.73k • 21

This is the official set of weights for the paper “Beyond Next-Token Alignment: Distilling Multimodal Large Language Models via Token Interactions.”

lchen1019/LlavaQwen2-Align-TI-1B

Image-Text-to-Text • 1B • Updated Feb 11 • 5
lchen1019/LlavaQwen3-Align-TI-2B

2B • Updated Oct 29, 2025 • 3
lchen1019/LlavaQwen2-Align-TI-2B

2B • Updated Oct 29, 2025 • 4
lchen1019/LlavaQwen3-Align-TI-1B

1B • Updated Oct 29, 2025 • 4

Checkpoints, data and logs of MemoryVLA & MemoryVLA+. https://github.com/shihao1895/MemoryVLA

shihao1895/memvla-libero-spatial

Updated Nov 5, 2025 • 66
shihao1895/memvla-libero-object

Updated Nov 5, 2025 • 29
shihao1895/memvla-libero-goal

Updated Nov 5, 2025 • 34
shihao1895/memvla-libero-100

Updated Nov 5, 2025 • 180

Pretrained ARC-Encoders and a fine-tuning dataset: context compression for unmodified LLMs.

ARC-Encoder: learning compressed text representations for large language models

Paper • 2510.20535 • Published Oct 23, 2025 • 10
kyutai/ARC8_Encoder_Llama

Feature Extraction • Updated Nov 5, 2025 • 12 • 3
kyutai/ARC_finetuning

Preview • Updated Oct 24, 2025 • 61 • 1
kyutai/ARC8_Encoder_multi

Feature Extraction • Updated Nov 5, 2025 • 54 • 7

Retrofitting Recurrence

Teaching Pretrained Language Models to Think Deeper with Retrofitted Recurrence

Paper • 2511.07384 • Published Nov 10, 2025 • 20
smcleish/Recurrent-Llama-3.2-train-recurrence-32

Text Generation • 1B • Updated Nov 11, 2025 • 125 • 1
smcleish/Recurrent-Llama-3.2-train-recurrence-16

Text Generation • 1B • Updated Nov 11, 2025 • 13
smcleish/Recurrent-Llama-3.2-train-recurrence-8

Text Generation • 1B • Updated Nov 11, 2025 • 52

Running

598

NSFW Image Generator

😻

598

Uncensored AI Image Generator
Running

Agents

325

Miragic AI Image Generator

🎨

325

Create stunning AI-generated images with text prompts
Runtime error

Agents

4

Nfsw Image

🖼

4

Generate images from text prompts
Runtime error

Agents

3

Nudes

🖼

3

Test

Content moderation models and datasets - 2025

Models and datasets that support automatic content moderation

A Holistic Approach to Undesired Content Detection in the Real World

Paper • 2208.03274 • Published Aug 5, 2022
ToxicChat: Unveiling Hidden Challenges of Toxicity Detection in Real-World User-AI Conversation

Paper • 2310.17389 • Published Oct 26, 2023
gpt-oss-safeguard

Collection

gpt-oss-safeguard-120b and gpt-oss-safeguard-20b are safety reasoning models built-upon gpt-oss • 2 items • Updated Oct 29, 2025 • 73
GPT-OSS-safeguard:20b

Collection

MLX Based GPT-OSS-Safeguard models • 5 items • Updated Nov 1, 2025

Pairwise Rotation Quantization for Efficient Reasoning LLM Inference

ParoQuant: Pairwise Rotation Quantization for Efficient Reasoning LLM Inference

Paper • 2511.10645 • Published Nov 13, 2025 • 13
z-lab/Qwen3.6-27B-PARO

Image-Text-to-Text • 6B • Updated May 15 • 5.79k • 27
z-lab/Qwen3.6-35B-A3B-PARO

Image-Text-to-Text • 6B • Updated May 15 • 3.47k • 6
z-lab/gemma-4-31B-it-PARO

Image-Text-to-Text • 6B • Updated May 15 • 1.73k • 21

Paza is a collection of speech models & benchmarks for low resource languages by the Microsoft Research Africa - Nairobi Lab

Running

Agents

20

PazaBench

🥇

20

ASR Leaderboard for low resource languages
microsoft/paza-Phi-4-multimodal-instruct

Automatic Speech Recognition • 6B • Updated Feb 4 • 157 • 4
microsoft/paza-whisper-large-v3-turbo

Automatic Speech Recognition • 0.8B • Updated Feb 4 • 458 • 8

This is the official set of weights for the paper “Beyond Next-Token Alignment: Distilling Multimodal Large Language Models via Token Interactions.”

lchen1019/LlavaQwen2-Align-TI-1B

Image-Text-to-Text • 1B • Updated Feb 11 • 5
lchen1019/LlavaQwen3-Align-TI-2B

2B • Updated Oct 29, 2025 • 3
lchen1019/LlavaQwen2-Align-TI-2B

2B • Updated Oct 29, 2025 • 4
lchen1019/LlavaQwen3-Align-TI-1B

1B • Updated Oct 29, 2025 • 4

about 17 hours ago

datalab-to/chandra

Image-Text-to-Text • 9B • Updated Mar 26 • 207k • 525
datalab-to/chandra-ocr-2

Image-Text-to-Text • 5B • Updated Mar 18 • 2M • 391
datalab-to/lift

Image-Text-to-Text • 10B • Updated about 22 hours ago • 68
datalab-to/surya-ocr-2

Image-Text-to-Text • 0.7B • Updated 24 days ago • 327k • 47

Checkpoints, data and logs of MemoryVLA & MemoryVLA+. https://github.com/shihao1895/MemoryVLA

shihao1895/memvla-libero-spatial

Updated Nov 5, 2025 • 66
shihao1895/memvla-libero-object

Updated Nov 5, 2025 • 29
shihao1895/memvla-libero-goal

Updated Nov 5, 2025 • 34
shihao1895/memvla-libero-100

Updated Nov 5, 2025 • 180

From Data Curation to Reinforcement Learning: Building a Strong Grounding Model for Computer-Use Agents

mlfoundations/Gelato-30B-A3B

Image-Text-to-Text • 31B • Updated Nov 15, 2025 • 74 • 33
mlfoundations/Click-100k

Viewer • Updated Nov 11, 2025 • 101k • 827 • 18
🍨 Gelato-30B-A3B Checkpoints

Collection

Intermediate checkpoints for Gelato-30B-A3B. Refer to https://github.com/mlfoundations/gelato for more details. • 29 items • Updated Oct 29, 2025 • 1
mlfoundations/gelato-osworld-agent-trajectories

Viewer • Updated Nov 6, 2025 • 13.5k • 313 • 2

Pretrained ARC-Encoders and a fine-tuning dataset: context compression for unmodified LLMs.

ARC-Encoder: learning compressed text representations for large language models

Paper • 2510.20535 • Published Oct 23, 2025 • 10
kyutai/ARC8_Encoder_Llama

Feature Extraction • Updated Nov 5, 2025 • 12 • 3
kyutai/ARC_finetuning

Preview • Updated Oct 24, 2025 • 61 • 1
kyutai/ARC8_Encoder_multi

Feature Extraction • Updated Nov 5, 2025 • 54 • 7

Previous
1
...
43
44
45
46
47
...
21,245
Next

Company

TOS Privacy About Careers

Website

Models Datasets Spaces Pricing Docs