Models
Datasets
Spaces
Buckets new
Docs
Enterprise
Pricing
- Website
- Community
- Solutions
Log In
Sign Up

Collections

Discover the best community collections!

Collections trending this week

lllyasviel/ControlNet-v1-1

Updated Apr 25, 2023 • 4.09k
TencentARC/T2I-Adapter

Updated Aug 22, 2023 • 845

baichuan-inc/Baichuan2-13B-Chat

Text Generation • Updated Feb 26, 2024 • 9.3k • 432

stabilityai/stable-diffusion-xl-base-1.0

Text-to-Image • Updated Oct 30, 2023 • 1.33M • • 7.88k
QuixiAI/Wizard-Vicuna-30B-Uncensored

Text Generation • Updated May 20, 2024 • 380 • 164

Dataset-Pretrain

An Empirical Study of Scaling Instruct-Tuned Large Multimodal Models

Paper • 2309.09958 • Published Sep 18, 2023 • 20
TextBind: Multi-turn Interleaved Multimodal Instruction-following

Paper • 2309.08637 • Published Sep 14, 2023 • 7
AnyMAL: An Efficient and Scalable Any-Modality Augmented Language Model

Paper • 2309.16058 • Published Sep 27, 2023 • 56
Qwen Technical Report

Paper • 2309.16609 • Published Sep 28, 2023 • 39

Sparse Autoencoders Find Highly Interpretable Features in Language Models

Paper • 2309.08600 • Published Sep 15, 2023 • 15
Learning to Skip the Middle Layers of Transformers

Paper • 2506.21103 • Published Jun 26, 2025 • 18
Alleviating Sparse Rewards by Modeling Step-Wise and Long-Term Sampling Effects in Flow-Based GRPO

Paper • 2602.06422 • Published Feb 6 • 47

Contrastive Preference Optimization: Pushing the Boundaries of LLM Performance in Machine Translation

Paper • 2401.08417 • Published Jan 16, 2024 • 37
PLaD: Preference-based Large Language Model Distillation with Pseudo-Preference Pairs

Paper • 2406.02886 • Published Jun 5, 2024 • 10

lllyasviel/ControlNet-v1-1

Updated Apr 25, 2023 • 4.09k
TencentARC/T2I-Adapter

Updated Aug 22, 2023 • 845

An Empirical Study of Scaling Instruct-Tuned Large Multimodal Models

Paper • 2309.09958 • Published Sep 18, 2023 • 20
TextBind: Multi-turn Interleaved Multimodal Instruction-following

Paper • 2309.08637 • Published Sep 14, 2023 • 7
AnyMAL: An Efficient and Scalable Any-Modality Augmented Language Model

Paper • 2309.16058 • Published Sep 27, 2023 • 56
Qwen Technical Report

Paper • 2309.16609 • Published Sep 28, 2023 • 39

baichuan-inc/Baichuan2-13B-Chat

Text Generation • Updated Feb 26, 2024 • 9.3k • 432

Sparse Autoencoders Find Highly Interpretable Features in Language Models

Paper • 2309.08600 • Published Sep 15, 2023 • 15
Learning to Skip the Middle Layers of Transformers

Paper • 2506.21103 • Published Jun 26, 2025 • 18
Alleviating Sparse Rewards by Modeling Step-Wise and Long-Term Sampling Effects in Flow-Based GRPO

Paper • 2602.06422 • Published Feb 6 • 47

stabilityai/stable-diffusion-xl-base-1.0

Text-to-Image • Updated Oct 30, 2023 • 1.33M • • 7.88k
QuixiAI/Wizard-Vicuna-30B-Uncensored

Text Generation • Updated May 20, 2024 • 380 • 164

Contrastive Preference Optimization: Pushing the Boundaries of LLM Performance in Machine Translation

Paper • 2401.08417 • Published Jan 16, 2024 • 37
PLaD: Preference-based Large Language Model Distillation with Pseudo-Preference Pairs

Paper • 2406.02886 • Published Jun 5, 2024 • 10

Dataset-Pretrain

Previous
1
...
21,341
21,342
21,343
21,344
21,345
...
21,529
Next

Company

TOS Privacy About Careers

Website

Models Datasets Spaces Pricing Docs