lorafrenzi

community
Activity Feed

AI & ML interests

None defined yet.

ovi054 
posted an update 3 months ago
ovi054 
posted an update 3 months ago
view post
Post
2267
My project, Anim-Lab-AI, won the Community Choice Award at the MCP-1st-Birthday hackathon by @HuggingFace and @Gradio ! 🏆

It turns any idea or complex concept into a clear, engaging explainer animation video. 🎥

I want to thank everyone in the Hugging Face community for supporting my project!

MCP-1st-Birthday/anim-lab-ai
  • 2 replies
·
ovi054 
posted an update 4 months ago
view post
Post
2705
Z-Image Turbo + LoRA ⚡

ovi054/Z-Image-LORA

Z-Image Turbo is the No. 1 trending Text-to-Image model right now. You can add a custom LoRA and generate images with this Space.

👉 Try it now: ovi054/Z-Image-LORA
  • 3 replies
·
ovi054 
posted an update 5 months ago
view post
Post
3402
Anim Lab AI⚡

Turn any math concept or logic into a clear video explanation instantly using AI.

This is my submission for the MCP 1st Birthday Hackathon, and it’s already crossed 1,000 runs.

👉 Try it now: MCP-1st-Birthday/anim-lab-ai

Demo outputs are attached 👇
ovi054 
posted an update 5 months ago
view post
Post
6161
Introducing Anim Lab AI⚡

My submission for the MCP 1st Birthday Hackathon

Turn any math concept or logic into a clear video explanation instantly using AI.

👉 Try it now: MCP-1st-Birthday/anim-lab-ai

Demo outputs are attached 👇
ovi054 
posted an update 7 months ago
ovi054 
posted an update 8 months ago
view post
Post
6008
Image-to-Prompt⚡

ovi054/image-to-prompt

Extract text prompt from image. And you can reuse the prompt to generate similar images!

Useful for prompt engineering, studying image-to-text alignment, making training datasets, or recreating similar outputs.

Powered by: Gradio, Florence 2

👉 Try it now: ovi054/image-to-prompt
  • 3 replies
·
ovi054 
posted an update 8 months ago
view post
Post
4544
Update on https://huggingface.co/spaces/ovi054/Qwen-Image-LORA

You can now load a Qwen LoRA in this space as follows:

1. Model ID:
flymy-ai/qwen-image-realism-lora

2. Model link:
https://huggingface.co/flymy-ai/qwen-image-realism-lora

3. Specific file link:
https://huggingface.co/flymy-ai/qwen-image-realism-lora/blob/main/flymy_realism.safetensors

4. Direct download link:
https://huggingface.co/flymy-ai/qwen-image-realism-lora/resolve/main/flymy_realism.safetensors

You can also use an external .safetensors download link (if Hugging Face doesn’t block it).

It is useful if a model repository contains multiple weights and you want to load a specific one.

👉 Try it now: https://huggingface.co/spaces/ovi054/Qwen-Image-LORA
ovi054 
posted an update 9 months ago
view post
Post
3792
WAN 2.2 Text to Image ⚡

ovi054/wan2-2-text-to-image

We all know that WAN 2.2 A14B is a video model. But It turns out this video model can also produce great image results with incredible prompt adherence! The image output is sharp, detailed, and sticks to the prompt better than most.

👉 Try it now: ovi054/wan2-2-text-to-image
  • 1 reply
·
ovi054 
posted an update 9 months ago
ovi054 
posted an update about 1 year ago
view post
Post
3749
Image-to-Vector ⚡

ovi054/image-to-vector

Transform Images into Professional Vector Graphics
Convert your raster images (JPG, PNG, WEBP) into high-quality vector graphics (SVG) with our easy-to-use tool! Perfect for designers, artists, and anyone needing vector conversions.

🎯 Key Features:

Convert to scalable SVG vector graphics
Real-time preview of your SVG output
Advanced customization options
Clean, user-friendly interface
Batch processing ready

🛠️ Advanced Controls:

Color/B&W mode selection
Speckle filtering
Color precision adjustment
Layer management
Curve fitting options

💫 Why Image-to-Vector?

No installation needed
Free to use
Professional-grade output
Simple yet powerful

🔧 Technical Details:

Built with Gradio
Powered by VTracer
Optimized SVG generation

👉 Try it now: ovi054/image-to-vector


#computervision #vectorgraphics #imageprocessing #svg #design #ai
MichaelBoll 
posted an update over 1 year ago
view post
Post
764
Gradio not scrollable on iOS
  • 2 replies
·
MichaelBoll 
posted an update over 1 year ago
multimodalart 
posted an update almost 2 years ago
multimodalart 
posted an update almost 2 years ago
view post
Post
28636
The first open Stable Diffusion 3-like architecture model is JUST out 💣 - but it is not SD3! 🤔

It is Tencent-Hunyuan/HunyuanDiT by Tencent, a 1.5B parameter DiT (diffusion transformer) text-to-image model 🖼️✨, trained with multi-lingual CLIP + multi-lingual T5 text-encoders for english 🤝 chinese understanding

Try it out by yourself here ▶️ https://huggingface.co/spaces/multimodalart/HunyuanDiT
(a bit too slow as the model is chunky and the research code isn't super optimized for inference speed yet)

In the paper they claim to be SOTA open source based on human preference evaluation!
multimodalart 
posted an update about 2 years ago
view post
Post
The Stable Diffusion 3 research paper broken down, including some overlooked details! 📝

Model
📏 2 base model variants mentioned: 2B and 8B sizes

📐 New architecture in all abstraction levels:
- 🔽 UNet; ⬆️ Multimodal Diffusion Transformer, bye cross attention 👋
- 🆕 Rectified flows for the diffusion process
- 🧩 Still a Latent Diffusion Model

📄 3 text-encoders: 2 CLIPs, one T5-XXL; plug-and-play: removing the larger one maintains competitiveness

🗃️ Dataset was deduplicated with SSCD which helped with memorization (no more details about the dataset tho)

Variants
🔁 A DPO fine-tuned model showed great improvement in prompt understanding and aesthetics
✏️ An Instruct Edit 2B model was trained, and learned how to do text-replacement

Results
✅ State of the art in automated evals for composition and prompt understanding
✅ Best win rate in human preference evaluation for prompt understanding, aesthetics and typography (missing some details on how many participants and the design of the experiment)

Paper: https://stabilityai-public-packages.s3.us-west-2.amazonaws.com/Stable+Diffusion+3+Paper.pdf
  • 3 replies
·
multimodalart 
posted an update about 2 years ago
multimodalart 
posted an update about 2 years ago