lora-training-frenzi (lorafrenzi)

posted an update 3 months ago

Post

3507

ovi054/LTX-2-19b-Squish-LoRA ⚡

I trained a Squish LoRA for LTX-2. Upload an image and give prompt "squish it" to get the squish video.

Demo output videos are attached.

👉Try it now:
ovi054/LTX-2-19b-Squish-LoRA
ovi054/ltx-2-Audio-to-Video

ovi054

posted an update 3 months ago

Post

2267

My project, Anim-Lab-AI, won the Community Choice Award at the MCP-1st-Birthday hackathon by @HuggingFace and @Gradio ! 🏆

It turns any idea or complex concept into a clear, engaging explainer animation video. 🎥

I want to thank everyone in the Hugging Face community for supporting my project!

MCP-1st-Birthday/anim-lab-ai

2 replies

·

ovi054

posted an update 4 months ago

Post

2705

Z-Image Turbo + LoRA ⚡

ovi054/Z-Image-LORA

Z-Image Turbo is the No. 1 trending Text-to-Image model right now. You can add a custom LoRA and generate images with this Space.

👉 Try it now: ovi054/Z-Image-LORA

3 replies

·

ovi054

posted an update 5 months ago

Post

3402

Anim Lab AI⚡

Turn any math concept or logic into a clear video explanation instantly using AI.

This is my submission for the MCP 1st Birthday Hackathon, and it’s already crossed 1,000 runs.

👉 Try it now: MCP-1st-Birthday/anim-lab-ai

Demo outputs are attached 👇

ovi054

posted an update 5 months ago

Post

6161

Introducing Anim Lab AI⚡

My submission for the MCP 1st Birthday Hackathon

Turn any math concept or logic into a clear video explanation instantly using AI.

👉 Try it now: MCP-1st-Birthday/anim-lab-ai

Demo outputs are attached 👇

ovi054

posted an update 7 months ago

Post

2455

Virtual Try-On LoRA + FLUX.1 Kontext [dev] ⚡

Model: ovi054/virtual-tryon-kontext-lora

Place the garment onto the model image as an overlay and the LoRA model will generate a realistic try-on result.

👉 Try it now: ovi054/virtual-tryon-kontext-lora

ovi054

posted an update 8 months ago

Post

6008

Image-to-Prompt⚡

ovi054/image-to-prompt

Extract text prompt from image. And you can reuse the prompt to generate similar images!

Useful for prompt engineering, studying image-to-text alignment, making training datasets, or recreating similar outputs.

Powered by: Gradio, Florence 2

👉 Try it now: ovi054/image-to-prompt

3 replies

·

ovi054

posted an update 8 months ago

Post

4544

Update on https://huggingface.co/spaces/ovi054/Qwen-Image-LORA ⚡

You can now load a Qwen LoRA in this space as follows:

1. Model ID:

flymy-ai/qwen-image-realism-lora

2. Model link:

https://huggingface.co/flymy-ai/qwen-image-realism-lora

3. Specific file link:

https://huggingface.co/flymy-ai/qwen-image-realism-lora/blob/main/flymy_realism.safetensors

4. Direct download link:

https://huggingface.co/flymy-ai/qwen-image-realism-lora/resolve/main/flymy_realism.safetensors

You can also use an external .safetensors download link (if Hugging Face doesn’t block it).

It is useful if a model repository contains multiple weights and you want to load a specific one.

👉 Try it now: https://huggingface.co/spaces/ovi054/Qwen-Image-LORA

ovi054

posted an update 9 months ago

Post

3792

WAN 2.2 Text to Image ⚡

ovi054/wan2-2-text-to-image

We all know that WAN 2.2 A14B is a video model. But It turns out this video model can also produce great image results with incredible prompt adherence! The image output is sharp, detailed, and sticks to the prompt better than most.

👉 Try it now: ovi054/wan2-2-text-to-image

1 reply

·

ovi054

posted an update 9 months ago

Post

2531

Qwen Image + LoRA ⚡

https://huggingface.co/spaces/ovi054/Qwen-Image-LORA

Qwen Image is the No. 1 trending Text-to-Image model right now. You can add a custom LoRA and generate images with this Space.

👉 Try it now: https://huggingface.co/spaces/ovi054/Qwen-Image-LORA

9 replies

·

ovi054

posted an update about 1 year ago

Post

3749

Image-to-Vector ⚡

ovi054/image-to-vector

Transform Images into Professional Vector Graphics
Convert your raster images (JPG, PNG, WEBP) into high-quality vector graphics (SVG) with our easy-to-use tool! Perfect for designers, artists, and anyone needing vector conversions.

🎯 Key Features:

Convert to scalable SVG vector graphics
Real-time preview of your SVG output
Advanced customization options
Clean, user-friendly interface
Batch processing ready

🛠️ Advanced Controls:

Color/B&W mode selection
Speckle filtering
Color precision adjustment
Layer management
Curve fitting options

💫 Why Image-to-Vector?

No installation needed
Free to use
Professional-grade output
Simple yet powerful

🔧 Technical Details:

Built with Gradio
Powered by VTracer
Optimized SVG generation

👉 Try it now: ovi054/image-to-vector

#computervision #vectorgraphics #imageprocessing #svg #design #ai

MichaelBoll

posted an update over 1 year ago

Post

764

Gradio not scrollable on iOS

2 replies

·

MichaelBoll

posted an update over 1 year ago

Post

1072

@jbilcke-hf hi

multimodalart

posted an update almost 2 years ago

Post

35624

New feature 🔥
Image models and LoRAs now have little previews 🤏

If you don't know where to start to find them, I invite you to browse cool LoRAs in the profile of some amazing fine-tuners: @artificialguybr , @alvdansen , @DoctorDiffusion , @e-n-v-y , @KappaNeuro @ostris

3 replies

·

multimodalart

posted an update almost 2 years ago

Post

28636

The first open Stable Diffusion 3-like architecture model is JUST out 💣 - but it is not SD3! 🤔

It is Tencent-Hunyuan/HunyuanDiT by Tencent, a 1.5B parameter DiT (diffusion transformer) text-to-image model 🖼️✨, trained with multi-lingual CLIP + multi-lingual T5 text-encoders for english 🤝 chinese understanding

Try it out by yourself here ▶️ https://huggingface.co/spaces/multimodalart/HunyuanDiT
(a bit too slow as the model is chunky and the research code isn't super optimized for inference speed yet)

In the paper they claim to be SOTA open source based on human preference evaluation!

multimodalart

posted an update about 2 years ago

Post

The Stable Diffusion 3 research paper broken down, including some overlooked details! 📝

Model
📏 2 base model variants mentioned: 2B and 8B sizes

📐 New architecture in all abstraction levels:
- 🔽 UNet; ⬆️ Multimodal Diffusion Transformer, bye cross attention 👋
- 🆕 Rectified flows for the diffusion process
- 🧩 Still a Latent Diffusion Model

📄 3 text-encoders: 2 CLIPs, one T5-XXL; plug-and-play: removing the larger one maintains competitiveness

🗃️ Dataset was deduplicated with SSCD which helped with memorization (no more details about the dataset tho)

Variants
🔁 A DPO fine-tuned model showed great improvement in prompt understanding and aesthetics
✏️ An Instruct Edit 2B model was trained, and learned how to do text-replacement

Results
✅ State of the art in automated evals for composition and prompt understanding
✅ Best win rate in human preference evaluation for prompt understanding, aesthetics and typography (missing some details on how many participants and the design of the experiment)

Paper: https://stabilityai-public-packages.s3.us-west-2.amazonaws.com/Stable+Diffusion+3+Paper.pdf

3 replies

·

multimodalart

posted an update about 2 years ago

Post

⚔️ The TIGERLab's Text2Image arena is here! ⚔️
TIGER-Lab/GenAI-Arena

Like https://huggingface.co/spaces/lmsys/chatbot-arena-leaderboard for LLMs: you prompt, two images emerge, vote for the best one 🏆

With enough votes this will lead to an Elo-based leaderboard for text-to-image models, go vote! 🗳️
TIGER-Lab/GenAI-Arena

multimodalart

posted an update about 2 years ago

Post

It seems February started with a fully open source AI renaissance 🌟

Models released with fully open dataset, training code, weights ✅

LLM - allenai/olmo-suite-65aeaae8fe5b6b2122b46778 🧠
Embedding - nomic-ai/nomic-embed-text-v1 📚 (sota!)

And it's literally February 1st - can't wait to see what else the community will bring 👀

multimodalart

authored 2 papers over 2 years ago

LEDITS++: Limitless Image Editing using Text-to-Image Models

Paper • 2311.16711 • Published Nov 28, 2023 • 25

LCM-LoRA: A Universal Stable-Diffusion Acceleration Module

Paper • 2311.05556 • Published Nov 9, 2023 • 86

AI & ML interests

Team members 843

lora-training-frenzi's activity