Models
Datasets
Spaces
Buckets new
Docs
Enterprise
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections trending this week

coqui/XTTS-v1

Text-to-Speech • Updated Nov 10, 2023 • 2.01k • 370
suno/bark

Text-to-Speech • Updated Oct 4, 2023 • 16.6k • 1.51k

jphme/Llama-2-13b-chat-german

Text Generation • Updated Oct 6, 2023 • 834 • 63

Blockwise Parallel Transformer for Long Context Large Models

Paper • 2305.19370 • Published May 30, 2023 • 3
Blockwise Self-Attention for Long Document Understanding

Paper • 1911.02972 • Published Nov 7, 2019 • 1
Blockwise Compression of Transformer-based Models without Retraining

Paper • 2304.01483 • Published Apr 4, 2023 • 1

Backpropagation

Sparse Backpropagation for MoE Training

Paper • 2310.00811 • Published Oct 1, 2023 • 2
The Forward-Forward Algorithm: Some Preliminary Investigations

Paper • 2212.13345 • Published Dec 27, 2022 • 5
Fine-Tuning Language Models with Just Forward Passes

Paper • 2305.17333 • Published May 27, 2023 • 4
Towards Green AI in Fine-tuning Large Language Models via Adaptive Backpropagation

Paper • 2309.13192 • Published Sep 22, 2023 • 1

Concept-Oriented Deep Learning with Large Language Models

Paper • 2306.17089 • Published Jun 29, 2023 • 1
Extracting Mathematical Concepts with Large Language Models

Paper • 2309.00642 • Published Aug 29, 2023 • 1
An Image is Worth Multiple Words: Learning Object Level Concepts using Multi-Concept Prompt Learning

Paper • 2310.12274 • Published Oct 18, 2023 • 13
COPEN: Probing Conceptual Knowledge in Pre-trained Language Models

Paper • 2211.04079 • Published Nov 8, 2022 • 1

Yntec/epiCPhotoGasm

Text-to-Image • Updated Apr 18, 2024 • 752 • 50
ThinkDiffusion/ThinkDiffusionXL

Text-to-Image • Updated Nov 18, 2023 • 165 • 19
redstonehero/virile_reality_2.0

Text-to-Image • Updated Jul 28, 2023

SSD-LM: Semi-autoregressive Simplex-based Diffusion Language Model for Text Generation and Modular Control

Paper • 2210.17432 • Published Oct 31, 2022 • 2
TESS: Text-to-Text Self-Conditioned Simplex Diffusion

Paper • 2305.08379 • Published May 15, 2023 • 3
Diffusion Language Models Can Perform Many Tasks with Scaling and Instruction-Finetuning

Paper • 2308.12219 • Published Aug 23, 2023 • 1
CodeFusion: A Pre-trained Diffusion Model for Code Generation

Paper • 2310.17680 • Published Oct 26, 2023 • 74

jeffwan/llama-30b-hf

Text Generation • Updated Apr 2, 2023 • 10

A Unified View of Long-Sequence Models towards Modeling Million-Scale Dependencies

Paper • 2302.06218 • Published Feb 13, 2023 • 1
ZeRO++: Extremely Efficient Collective Communication for Giant Model Training

Paper • 2306.10209 • Published Jun 16, 2023 • 2
SE-MoE: A Scalable and Efficient Mixture-of-Experts Distributed Training and Inference System

Paper • 2205.10034 • Published May 20, 2022 • 1
A Hybrid Tensor-Expert-Data Parallelism Approach to Optimize Mixture-of-Experts Training

Paper • 2303.06318 • Published Mar 11, 2023 • 1

Concept-Oriented Deep Learning with Large Language Models

Paper • 2306.17089 • Published Jun 29, 2023 • 1
Extracting Mathematical Concepts with Large Language Models

Paper • 2309.00642 • Published Aug 29, 2023 • 1
An Image is Worth Multiple Words: Learning Object Level Concepts using Multi-Concept Prompt Learning

Paper • 2310.12274 • Published Oct 18, 2023 • 13
COPEN: Probing Conceptual Knowledge in Pre-trained Language Models

Paper • 2211.04079 • Published Nov 8, 2022 • 1

coqui/XTTS-v1

Text-to-Speech • Updated Nov 10, 2023 • 2.01k • 370
suno/bark

Text-to-Speech • Updated Oct 4, 2023 • 16.6k • 1.51k

Yntec/epiCPhotoGasm

Text-to-Image • Updated Apr 18, 2024 • 752 • 50
ThinkDiffusion/ThinkDiffusionXL

Text-to-Image • Updated Nov 18, 2023 • 165 • 19
redstonehero/virile_reality_2.0

Text-to-Image • Updated Jul 28, 2023

jphme/Llama-2-13b-chat-german

Text Generation • Updated Oct 6, 2023 • 834 • 63

SSD-LM: Semi-autoregressive Simplex-based Diffusion Language Model for Text Generation and Modular Control

Paper • 2210.17432 • Published Oct 31, 2022 • 2
TESS: Text-to-Text Self-Conditioned Simplex Diffusion

Paper • 2305.08379 • Published May 15, 2023 • 3
Diffusion Language Models Can Perform Many Tasks with Scaling and Instruction-Finetuning

Paper • 2308.12219 • Published Aug 23, 2023 • 1
CodeFusion: A Pre-trained Diffusion Model for Code Generation

Paper • 2310.17680 • Published Oct 26, 2023 • 74

Blockwise Parallel Transformer for Long Context Large Models

Paper • 2305.19370 • Published May 30, 2023 • 3
Blockwise Self-Attention for Long Document Understanding

Paper • 1911.02972 • Published Nov 7, 2019 • 1
Blockwise Compression of Transformer-based Models without Retraining

Paper • 2304.01483 • Published Apr 4, 2023 • 1

jeffwan/llama-30b-hf

Text Generation • Updated Apr 2, 2023 • 10

Backpropagation

Sparse Backpropagation for MoE Training

Paper • 2310.00811 • Published Oct 1, 2023 • 2
The Forward-Forward Algorithm: Some Preliminary Investigations

Paper • 2212.13345 • Published Dec 27, 2022 • 5
Fine-Tuning Language Models with Just Forward Passes

Paper • 2305.17333 • Published May 27, 2023 • 4
Towards Green AI in Fine-tuning Large Language Models via Adaptive Backpropagation

Paper • 2309.13192 • Published Sep 22, 2023 • 1

A Unified View of Long-Sequence Models towards Modeling Million-Scale Dependencies

Paper • 2302.06218 • Published Feb 13, 2023 • 1
ZeRO++: Extremely Efficient Collective Communication for Giant Model Training

Paper • 2306.10209 • Published Jun 16, 2023 • 2
SE-MoE: A Scalable and Efficient Mixture-of-Experts Distributed Training and Inference System

Paper • 2205.10034 • Published May 20, 2022 • 1
A Hybrid Tensor-Expert-Data Parallelism Approach to Optimize Mixture-of-Experts Training

Paper • 2303.06318 • Published Mar 11, 2023 • 1

Previous
1
...
18,376
18,377
18,378
18,379
18,380
...
19,021
Next

Company

TOS Privacy About Careers

Website

Models Datasets Spaces Pricing Docs