Collections
Discover the best community collections!
Collections trending this week
-
Stable Diffusion Webui
π»16Generate images from text prompts
-
Stable Diffusion 3 Medium Superpompt
π·42Stable Diffusion 3 Medium with SuperPrompt-v1 Enhancement!
-
IllusionDiffusion
π5.37kGenerate stunning high quality illusion artwork
-
Multi View Diffusion
π§64Generate multi-view images from text or images
-
apple/coreml-depth-anything-v2-small
Depth Estimation β’ Updated β’ 519 β’ 90 -
apple/coreml-depth-anything-small
Depth Estimation β’ Updated β’ 234 β’ 40 -
apple/coreml-detr-semantic-segmentation
Image Segmentation β’ Updated β’ 463 β’ 32 -
apple/coreml-FastViT-T8
Image Classification β’ Updated β’ 62 β’ 16
-
GAMA: A Large Audio-Language Model with Advanced Audio Understanding and Complex Reasoning Abilities
Paper β’ 2406.11768 β’ Published β’ 24 -
Investigating Decoder-only Large Language Models for Speech-to-text Translation
Paper β’ 2407.03169 β’ Published β’ 11 -
PicoAudio: Enabling Precise Timestamp and Frequency Controllability of Audio Events in Text-to-audio Generation
Paper β’ 2407.02869 β’ Published β’ 21 -
FunAudioLLM: Voice Understanding and Generation Foundation Models for Natural Interaction Between Humans and LLMs
Paper β’ 2407.04051 β’ Published β’ 40
-
π§© DiffuseCraft Mod (SDXL/SD1.5 Models Text-to-Image)
π§©197Stunning images using stable diffusion.
-
Votepurchase Multiple Model (SD1.5/SDXL Text-to-Image)
πΌ136Text-to-Image
-
FLUX LoRA the Explorer Mod
π152Generate images from text prompts and images
-
π§© DiffuseCraft
π§©213Stunning images using stable diffusion.
-
GAMA: A Large Audio-Language Model with Advanced Audio Understanding and Complex Reasoning Abilities
Paper β’ 2406.11768 β’ Published β’ 24 -
Investigating Decoder-only Large Language Models for Speech-to-text Translation
Paper β’ 2407.03169 β’ Published β’ 11 -
PicoAudio: Enabling Precise Timestamp and Frequency Controllability of Audio Events in Text-to-audio Generation
Paper β’ 2407.02869 β’ Published β’ 21 -
FunAudioLLM: Voice Understanding and Generation Foundation Models for Natural Interaction Between Humans and LLMs
Paper β’ 2407.04051 β’ Published β’ 40
-
Stable Diffusion Webui
π»16Generate images from text prompts
-
Stable Diffusion 3 Medium Superpompt
π·42Stable Diffusion 3 Medium with SuperPrompt-v1 Enhancement!
-
IllusionDiffusion
π5.37kGenerate stunning high quality illusion artwork
-
Multi View Diffusion
π§64Generate multi-view images from text or images
-
apple/coreml-depth-anything-v2-small
Depth Estimation β’ Updated β’ 519 β’ 90 -
apple/coreml-depth-anything-small
Depth Estimation β’ Updated β’ 234 β’ 40 -
apple/coreml-detr-semantic-segmentation
Image Segmentation β’ Updated β’ 463 β’ 32 -
apple/coreml-FastViT-T8
Image Classification β’ Updated β’ 62 β’ 16
-
π§© DiffuseCraft Mod (SDXL/SD1.5 Models Text-to-Image)
π§©197Stunning images using stable diffusion.
-
Votepurchase Multiple Model (SD1.5/SDXL Text-to-Image)
πΌ136Text-to-Image
-
FLUX LoRA the Explorer Mod
π152Generate images from text prompts and images
-
π§© DiffuseCraft
π§©213Stunning images using stable diffusion.