Models
Datasets
Spaces
Buckets new
Docs
Enterprise
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections trending this week

FlashDecoding++: Faster Large Language Model Inference on GPUs

Paper • 2311.01282 • Published Nov 2, 2023 • 37
Infinite-LLM: Efficient LLM Service for Long Context with DistAttention and Distributed KVCache

Paper • 2401.02669 • Published Jan 5, 2024 • 17
Speculative Streaming: Fast LLM Inference without Auxiliary Models

Paper • 2402.11131 • Published Feb 16, 2024 • 42

SamLowe/roberta-base-go_emotions

Text Classification • Updated Oct 4, 2023 • 340k • • 660

JudgeLM: Fine-tuned Large Language Models are Scalable Judges

Paper • 2310.17631 • Published Oct 26, 2023 • 35

myfirstCollection

Paused

Featured

1.1k

Fast Stable Diffusion

🔥

1.1k

Idempotent Generative Network

Paper • 2311.01462 • Published Nov 2, 2023 • 25
Adaptive Shells for Efficient Neural Radiance Field Rendering

Paper • 2311.10091 • Published Nov 16, 2023 • 19
Generative Powers of Ten

Paper • 2312.02149 • Published Dec 4, 2023 • 8
DreamVideo: Composing Your Dream Videos with Customized Subject and Motion

Paper • 2312.04433 • Published Dec 7, 2023 • 10

chatglm3-6b-32k

zai-org/chatglm3-6b-32k

Updated Aug 4, 2024 • 377 • 246

An Early Evaluation of GPT-4V(ision)

Paper • 2310.16534 • Published Oct 25, 2023 • 22

falcon-180b-gguf

TheBloke/Falcon-180B-GGUF

Updated Oct 19, 2023 • 13 • 20

FlashDecoding++: Faster Large Language Model Inference on GPUs

Paper • 2311.01282 • Published Nov 2, 2023 • 37

code-generation

bigcode/octocoder

Text Generation • 16B • Updated Aug 17, 2023 • 159 • 69

FlashDecoding++: Faster Large Language Model Inference on GPUs

Paper • 2311.01282 • Published Nov 2, 2023 • 37
Infinite-LLM: Efficient LLM Service for Long Context with DistAttention and Distributed KVCache

Paper • 2401.02669 • Published Jan 5, 2024 • 17
Speculative Streaming: Fast LLM Inference without Auxiliary Models

Paper • 2402.11131 • Published Feb 16, 2024 • 42

chatglm3-6b-32k

zai-org/chatglm3-6b-32k

Updated Aug 4, 2024 • 377 • 246

SamLowe/roberta-base-go_emotions

Text Classification • Updated Oct 4, 2023 • 340k • • 660

An Early Evaluation of GPT-4V(ision)

Paper • 2310.16534 • Published Oct 25, 2023 • 22

JudgeLM: Fine-tuned Large Language Models are Scalable Judges

Paper • 2310.17631 • Published Oct 26, 2023 • 35

falcon-180b-gguf

TheBloke/Falcon-180B-GGUF

Updated Oct 19, 2023 • 13 • 20

myfirstCollection

Paused

Featured

1.1k

Fast Stable Diffusion

🔥

1.1k

FlashDecoding++: Faster Large Language Model Inference on GPUs

Paper • 2311.01282 • Published Nov 2, 2023 • 37

Idempotent Generative Network

Paper • 2311.01462 • Published Nov 2, 2023 • 25
Adaptive Shells for Efficient Neural Radiance Field Rendering

Paper • 2311.10091 • Published Nov 16, 2023 • 19
Generative Powers of Ten

Paper • 2312.02149 • Published Dec 4, 2023 • 8
DreamVideo: Composing Your Dream Videos with Customized Subject and Motion

Paper • 2312.04433 • Published Dec 7, 2023 • 10

code-generation

bigcode/octocoder

Text Generation • 16B • Updated Aug 17, 2023 • 159 • 69

Previous
1
...
18,349
18,350
18,351
18,352
18,353
...
19,085
Next

Company

TOS Privacy About Careers

Website

Models Datasets Spaces Pricing Docs