Models
Datasets
Spaces
Buckets new
Docs
Enterprise
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections trending this week

lllyasviel/ControlNet-v1-1

Updated Apr 25, 2023 • 4.01k

segmind/SSD-1B

Text-to-Image • Updated Jan 8, 2024 • 5.09k • 830

Experts Weights Averaging: A New General Training Scheme for Vision Transformers

Paper • 2308.06093 • Published Aug 11, 2023 • 2
Platypus: Quick, Cheap, and Powerful Refinement of LLMs

Paper • 2308.07317 • Published Aug 14, 2023 • 25
Beyond Attentive Tokens: Incorporating Token Importance and Diversity for Efficient Vision Transformers

Paper • 2211.11315 • Published Nov 21, 2022 • 1
LoraHub: Efficient Cross-Task Generalization via Dynamic LoRA Composition

Paper • 2307.13269 • Published Jul 25, 2023 • 34

S^{3}: Increasing GPU Utilization during Generative Inference for Higher Throughput

Paper • 2306.06000 • Published Jun 9, 2023 • 1
PyramidInfer: Pyramid KV Cache Compression for High-throughput LLM Inference

Paper • 2405.12532 • Published May 21, 2024
SqueezeAttention: 2D Management of KV-Cache in LLM Inference via Layer-wise Optimal Budget

Paper • 2404.04793 • Published Apr 7, 2024 • 1
MiniCache: KV Cache Compression in Depth Dimension for Large Language Models

Paper • 2405.14366 • Published May 23, 2024 • 3

M-CLIP/XLM-Roberta-Large-Vit-B-32

Updated Sep 15, 2022 • 15.4k • 17

Running on Zero

Featured

5.38k

IllusionDiffusion

👁

5.38k

Generate stunning high quality illusion artwork

Weight averaging

Experts Weights Averaging: A New General Training Scheme for Vision Transformers

Paper • 2308.06093 • Published Aug 11, 2023 • 2
Weight Averaging Improves Knowledge Distillation under Domain Shift

Paper • 2309.11446 • Published Sep 20, 2023 • 1
SWAMP: Sparse Weight Averaging with Multiple Particles for Iterative Magnitude Pruning

Paper • 2305.14852 • Published May 24, 2023 • 1
Sparse Model Soups: A Recipe for Improved Pruning via Model Averaging

Paper • 2306.16788 • Published Jun 29, 2023 • 1

S^{3}: Increasing GPU Utilization during Generative Inference for Higher Throughput

Paper • 2306.06000 • Published Jun 9, 2023 • 1
Fast Distributed Inference Serving for Large Language Models

Paper • 2305.05920 • Published May 10, 2023 • 1
Response Length Perception and Sequence Scheduling: An LLM-Empowered LLM Inference Pipeline

Paper • 2305.13144 • Published May 22, 2023 • 1
Towards MoE Deployment: Mitigating Inefficiencies in Mixture-of-Expert (MoE) Inference

Paper • 2303.06182 • Published Mar 10, 2023 • 1

lllyasviel/ControlNet-v1-1

Updated Apr 25, 2023 • 4.01k
TencentARC/T2I-Adapter

Updated Aug 22, 2023 • 845
stabilityai/stable-diffusion-xl-base-1.0

Text-to-Image • Updated Oct 30, 2023 • 2.26M • • 7.54k
stabilityai/stable-diffusion-xl-refiner-1.0

Image-to-Image • Updated Sep 25, 2023 • 318k • 2.03k

lllyasviel/ControlNet-v1-1

Updated Apr 25, 2023 • 4.01k

M-CLIP/XLM-Roberta-Large-Vit-B-32

Updated Sep 15, 2022 • 15.4k • 17

segmind/SSD-1B

Text-to-Image • Updated Jan 8, 2024 • 5.09k • 830

Running on Zero

Featured

5.38k

IllusionDiffusion

👁

5.38k

Generate stunning high quality illusion artwork

Experts Weights Averaging: A New General Training Scheme for Vision Transformers

Paper • 2308.06093 • Published Aug 11, 2023 • 2
Platypus: Quick, Cheap, and Powerful Refinement of LLMs

Paper • 2308.07317 • Published Aug 14, 2023 • 25
Beyond Attentive Tokens: Incorporating Token Importance and Diversity for Efficient Vision Transformers

Paper • 2211.11315 • Published Nov 21, 2022 • 1
LoraHub: Efficient Cross-Task Generalization via Dynamic LoRA Composition

Paper • 2307.13269 • Published Jul 25, 2023 • 34

Weight averaging

Experts Weights Averaging: A New General Training Scheme for Vision Transformers

Paper • 2308.06093 • Published Aug 11, 2023 • 2
Weight Averaging Improves Knowledge Distillation under Domain Shift

Paper • 2309.11446 • Published Sep 20, 2023 • 1
SWAMP: Sparse Weight Averaging with Multiple Particles for Iterative Magnitude Pruning

Paper • 2305.14852 • Published May 24, 2023 • 1
Sparse Model Soups: A Recipe for Improved Pruning via Model Averaging

Paper • 2306.16788 • Published Jun 29, 2023 • 1

S^{3}: Increasing GPU Utilization during Generative Inference for Higher Throughput

Paper • 2306.06000 • Published Jun 9, 2023 • 1
PyramidInfer: Pyramid KV Cache Compression for High-throughput LLM Inference

Paper • 2405.12532 • Published May 21, 2024
SqueezeAttention: 2D Management of KV-Cache in LLM Inference via Layer-wise Optimal Budget

Paper • 2404.04793 • Published Apr 7, 2024 • 1
MiniCache: KV Cache Compression in Depth Dimension for Large Language Models

Paper • 2405.14366 • Published May 23, 2024 • 3

S^{3}: Increasing GPU Utilization during Generative Inference for Higher Throughput

Paper • 2306.06000 • Published Jun 9, 2023 • 1
Fast Distributed Inference Serving for Large Language Models

Paper • 2305.05920 • Published May 10, 2023 • 1
Response Length Perception and Sequence Scheduling: An LLM-Empowered LLM Inference Pipeline

Paper • 2305.13144 • Published May 22, 2023 • 1
Towards MoE Deployment: Mitigating Inefficiencies in Mixture-of-Expert (MoE) Inference

Paper • 2303.06182 • Published Mar 10, 2023 • 1

lllyasviel/ControlNet-v1-1

Updated Apr 25, 2023 • 4.01k
TencentARC/T2I-Adapter

Updated Aug 22, 2023 • 845
stabilityai/stable-diffusion-xl-base-1.0

Text-to-Image • Updated Oct 30, 2023 • 2.26M • • 7.54k
stabilityai/stable-diffusion-xl-refiner-1.0

Image-to-Image • Updated Sep 25, 2023 • 318k • 2.03k

Previous
1
...
18,380
18,381
18,382
18,383
18,384
...
19,002
Next

Company

TOS Privacy About Careers

Website

Models Datasets Spaces Pricing Docs