AI & ML interests

None defined yet.

prithivMLmodsย 
posted an update 6 days ago
view post
Post
4699
The Qwen3.5 Multimodal Understanding Demo, powered by Qwen3.5-2B, is now available on HF Spaces! It is a lightweight model designed for fast image and video reasoning. Built with Gradio, the demo showcases Image QA, Video QA, object detection, and 2D point tracking, along with real-time token streaming.

๐Ÿค— Demo: prithivMLmods/Qwen-3.5-HF-Demo
โœ… Collection: https://huggingface.co/collections/prithivMLmods/multimodal-implementations
๐Ÿ”— Qwen3.5-2B: Qwen/Qwen3.5-2B

To learn more, visit the app page or the respective model pages.
prithivMLmodsย 
posted an update 10 days ago
view post
Post
3954
QIE-Object-Remover-Bbox Demo removes objects and artifacts from selected regions using bounding box grounding. Built on Qwen-Image-Edit-2509 with Rapid Diffusers acceleration, it delivers fast 4-step inference via the QIE-2509 adapter. ๐Ÿค—๐Ÿ”ฅ

๐Ÿ”—Demo Space: prithivMLmods/QIE-Object-Remover-Bbox
๐Ÿ”—Qwen-Image-Edit-Rapid-AIO: prithivMLmods/Qwen-Image-Edit-Rapid-AIO-V4
๐Ÿ”—Adapter-(LoRA): prithivMLmods/QIE-2509-Object-Remover-Bbox

๐Ÿ”—Collection: https://huggingface.co/collections/prithivMLmods/qwen-image-edit-layout-bbox

To learn more, visit the app page or the respective model pages.
  • 1 reply
ยท
prithivMLmodsย 
posted an update 16 days ago
view post
Post
2504
FireRed-Image-Edit-1.0 (Rapid) Fast Experimental Demo is Out! ๐Ÿš€๐Ÿค—

Demo: prithivMLmods/FireRed-Image-Edit-1.0-Fast

-> Paired the EditPlusPipeline with the Diffusers-compatible transformer weights of Rapid AIO from Qwen-Image-Edit. (experimental)
-> This fusion delivers more accurate instruction following, higher image quality, and consistent visual coherence @ 4-step fast inference.
-> Better maintains text styles with high fidelity, along with high-quality old photo restoration, enhancement, and best-in-class virtual try-on.

prithivMLmodsย 
posted an update 21 days ago
prithivMLmodsย 
posted an update 25 days ago
view post
Post
2588
Dropping the Qwen3 VL Series of Unredacted MAX-VL models. These models have undergone multi-stage training to minimize refusal rates through continuous abliterated optimization. You can find the models in BF16, FP8-Dynamic, and GGUF formats at the links below.๐Ÿ”ฅ๐Ÿš€

Unredacted MAX - VL:
โžœ prithivMLmods/Qwen3-VL-4B-Instruct-Unredacted-MAX
โžœ prithivMLmods/Qwen3-VL-4B-Thinking-Unredacted-MAX
โžœ prithivMLmods/Qwen3-VL-8B-Instruct-Unredacted-MAX
โžœ prithivMLmods/Qwen3-VL-8B-Thinking-Unredacted-MAX

Unredacted MAX - VL [FP8]
โžœ prithivMLmods/Qwen3-VL-4B-Instruct-Unredacted-MAX-FP8
โžœ prithivMLmods/Qwen3-VL-4B-Thinking-Unredacted-MAX-FP8
โžœ prithivMLmods/Qwen3-VL-8B-Instruct-Unredacted-MAX-FP8
โžœ prithivMLmods/Qwen3-VL-8B-Thinking-Unredacted-MAX-FP8

Unredacted MAX - VL [GGUF]
โžœ prithivMLmods/Qwen3-VL-4B-Instruct-Unredacted-MAX-GGUF
โžœ prithivMLmods/Qwen3-VL-4B-Thinking-Unredacted-MAX-GGUF
โžœ prithivMLmods/Qwen3-VL-8B-Instruct-Unredacted-MAX-GGUF
โžœ prithivMLmods/Qwen3-VL-8B-Thinking-Unredacted-MAX-GGUF

Unredacted MAX - VL [Collection]
โžœ https://huggingface.co/collections/prithivMLmods/unredacted-max-vl-fp8
โžœ https://huggingface.co/collections/prithivMLmods/unredacted-max-vl
โžœ https://huggingface.co/collections/prithivMLmods/unredacted-max-vl-gguf

To learn more, visit the app page or the respective model pages.
prithivMLmodsย 
posted an update about 1 month ago
view post
Post
2999
Introducing FLUX.2-Klein-LoRA-Studio, a demo for image editing using specialized LoRA adapters built for the FLUX.2-Klein-Distilled model. It features an edit-style gallery for multi-style image editing, including de-light, face swap, mannequin, and more. Try the demo below.

๐Ÿค—Demo: prithivMLmods/FLUX.2-Klein-LoRA-Studio
๐Ÿค—Collection: https://huggingface.co/collections/prithivMLmods/image-generation-apps-collection
๐Ÿค—GitHub: https://github.com/PRITHIVSAKTHIUR/FLUX.2-Klein-LoRA-Studio

To learn more, visit the app page or the respective model pages.
prithivMLmodsย 
posted an update about 1 month ago
view post
Post
879
GLM OCR, a multimodal OCR model for complex document understanding, built on the GLM-V encoderโ€“decoder architecture. It delivers high accuracy and strong generalization with a blazing-fast inference pipeline. The demo is live . Try it now. ๐Ÿค—๐Ÿš€

โœจ Demo: prithivMLmods/GLM-OCR-Demo
โœจ Multimodal Implementations: https://huggingface.co/collections/prithivMLmods/multimodal-implementations
โœจ GitHub: https://github.com/PRITHIVSAKTHIUR/GLM-OCR-Demo
prithivMLmodsย 
posted an update about 1 month ago
view post
Post
2188
Introducing the Qwen-Image-Edit-3D-Lighting-Control app, featuring 8ร— horizontal and 3ร— elevational lighting positions for precise 3D lighting control. It enables studio-level lighting using fast Qwen Image Edit fast inference, paired with Multi-Angle-Lighting adapters. ๐Ÿ”ฆ

๐Ÿ”ฅ Space: prithivMLmods/Qwen-Image-Edit-3D-Lighting-Control
โœ… Collection: https://huggingface.co/collections/prithivMLmods/image-generation-apps-collection
๐Ÿ“‚ GitHub: https://github.com/PRITHIVSAKTHIUR/Qwen-Image-Edit-3D-Lighting-Control
prithivMLmodsย 
posted an update about 1 month ago
view post
Post
3667
Daggr UI version of the Qwen3-TTS demo.๐Ÿ”ฅ
(custom voice, voice design, qwen3-asr and voice cloning) nodes.
No remote spaces used for API inference; all functions run in-app fn.
Powered by t4-m and built with daggr@0.5.2 and gradio@6.

๐Ÿ‘‰Demo: prithivMLmods/Qwen3-TTS-Daggr-UI
โญGithub: https://github.com/PRITHIVSAKTHIUR/Qwen3-TTS-Daggr-UI
  • 1 reply
ยท
prithivMLmodsย 
posted an update about 2 months ago
view post
Post
2710
Qwen-Image-Edit-Object-Manipulator Space is now featured in Hugging Face Space of the Week. It enables object manipulation such as extracting objects, adding designs, and removing objects or designs from the red highlighted area using specialized adapters.

๐Ÿ”ฅDo enjoy the demo! ~ prithivMLmods/Qwen-Image-Edit-Object-Manipulator

Collections:
๐ŸงจAdapters-1: https://huggingface.co/collections/prithivMLmods/qwen-image-edit-exps
๐ŸงจAdapters-2: https://huggingface.co/collections/prithivMLmods/qie-jan-23-26
๐ŸงจAdapters-3: https://huggingface.co/collections/prithivMLmods/qwen-image-edit-object-manipulator

โญGithub: https://github.com/PRITHIVSAKTHIUR/Qwen-Image-Edit-Object-Manipulator

To learn more, visit the app page or the respective model pages.
  • 1 reply
ยท
prithivMLmodsย 
posted an update about 2 months ago
view post
Post
3058
Introducing QIE-2511-Zoom-Master for highlight-guided area zoom-in, enabling lossless zooming within a drawn square area, and QIE-2511-Object-Remover-v2 for precise object or highlight-guided area cleanup. These experimental adapters are trained based on QIE-2511. Find the adapters below.

๐Ÿ•น๏ธQIE-2511-Zoom-Master : prithivMLmods/QIE-2511-Zoom-Master
๐Ÿ•น๏ธQIE-2511-Object-Remover-v2: prithivMLmods/QIE-2511-Object-Remover-v2

๐Ÿค—Demo: prithivMLmods/Qwen-Image-Edit-Object-Manipulator

๐Ÿ“‚Collection: https://huggingface.co/collections/prithivMLmods/qwen-image-edit-exps

To learn more, visit the app page or the respective model pages.
  • 2 replies
ยท
telcomย 
posted an update about 2 months ago
view post
Post
165
How to tell to your Chat LLM to be more natural.
Add the below to it's personality
Prefer specific facts over vague importance. Do not inflate significance with phrases like โ€œplays a pivotal roleโ€ or โ€œmarks a turning point.โ€ Use numbers, dates, mechanisms, or measurable outcomes. Example: replace โ€œthe system changed logisticsโ€ with โ€œthe system reduced container dwell time from 6.2 to 4.1 days.โ€

Avoid promotional language. Keep a neutral tone. Do not use adjectives such as vibrant, groundbreaking, renowned, innovative, or powerful. Use plain wording.

Limit AI-typical vocabulary such as crucial, pivotal, intricate, tapestry, underscore, highlighting, emphasizing, showcasing, fostering, or enhance. Prefer simpler words.

Avoid generic commentary and vague attribution. Do not write โ€œthis reflects broader trends,โ€ โ€œexperts say,โ€ or โ€œresearchers suggestโ€ unless a named source is given.

Avoid formulaic structures such as โ€œnot only X but also Yโ€ or โ€œdespite its success it faces challenges.โ€ Use direct explanations.

Use lists sparingly. Prefer short paragraphs unless bullets improve clarity. Avoid triple-adjective patterns.

Prefer simple sentences like โ€œX is Yโ€ or โ€œthe system uses Z.โ€ Minimize formatting. Avoid emojis, decorative headings, and excessive bold.

Remove sentences that add no information. Avoid generic endings such as โ€œin conclusionโ€ or โ€œoverall.โ€ Use concrete examples, real actors, workflows, and technologies when possible. Write like technical documentation or a research summary, not marketing or blog prose.
ยท
telcomย 
posted an update about 2 months ago
view post
Post
1581
MAD-GRPO: https://huggingface.co/blog/telcom/mad-grpo
In R1-Zero-Like Training *, Dr.GRPO treats GRPOโ€™s by dropping std, but that often comes with a hidden side effect: length-weighted updates that can nudge model toward verbosity.
MAD-GRPO provides robust scale (MAD + epsilon) per-token normalization stability without verbosity bias.

*https://huggingface.co/papers/2503.20783

prithivMLmodsย 
posted an update 2 months ago
view post
Post
5589
LTX-2 Camera-Control LoRA demo with dolly-in/out and dolly-left/right is now available on Hugging Face, paired with ltx-2-19b-distilled-lora for fast inference. It also includes dynamic GPU duration adjustments for long video generations. Click the related Space links below.

๐Ÿค—Try it now on : prithivMLmods/LTX-2-LoRAs-Camera-Control-Dolly
โญGithub: https://github.com/PRITHIVSAKTHIUR/LTX-2-LoRAs-Camera-Control-Dolly
๐Ÿ•น๏ธCollection: https://huggingface.co/collections/prithivMLmods/image-generation-apps-collection

To learn more, visit the app page or the respective model pages.
  • 2 replies
ยท
telcomย 
posted an update 2 months ago
prithivMLmodsย 
posted an update 2 months ago
view post
Post
2483
Dropping Image Edit (Object Manipulator): Add or remove specified objects/designs, with flexible support for both single-image and multi-image modes.

๐Ÿค— Demo: prithivMLmods/Qwen-Image-Edit-Object-Manipulator

Qwen-Image-Edit-2511-Object-Remover is an adapter (LoRA) developed for Qwenโ€™s Qwen-Image-Edit-2511 image-to-image model. It is specifically designed for precise object removal from images.

โญ Model: prithivMLmods/Qwen-Image-Edit-2511-Object-Remover

Qwen-Image-Edit-2511-Object-Adder is an adapter (LoRA) developed for Qwenโ€™s Qwen-Image-Edit-2511 image-to-image model. It is specifically designed for precise object addition to images.

โญ Model: prithivMLmods/Qwen-Image-Edit-2511-Object-Adder

๐Ÿ•น๏ธ Collection: https://huggingface.co/collections/prithivMLmods/qwen-image-edit-object-manipulator
๐Ÿ•น๏ธ github: https://github.com/PRITHIVSAKTHIUR/Qwen-Image-Edit-Object-Manipulator

To learn more, visit the app page or the respective model pages.
telcomย 
posted an update 2 months ago
view post
Post
235
if you are interested in HUB (https://saemi410.github.io/HUB/ I recommend the fork I have created with some updates to make it smooth in running a smoke test git@github.com:javadtaghia/HUB.git) and you want to run the UCE (https://unified.baulab.info), please check:
- Model weights for UCE here: telcom/uce_NSFW
- Model weights for ESD here: telcom/esd_NSFW
- datasets and more download materials from: telcom/HUB_reference_dataset

Please read the notes in the model card.
prithivMLmodsย 
posted an update 2 months ago
view post
Post
4237
Update: TRELLIS.2 (Text to 3D, Image to 3D) Gradio with Rerun Embedded demo with improved visualization of the 3D model previewer is now available on Hugging Face. Generate assets and view them in the 3D viewer, powered and streamlined with Microsoftโ€™s TRELLIS.2 and Tongyi-MAIโ€™s Z-Image-Turbo models.

๐Ÿค— TRELLIS.2 (Demo): prithivMLmods/TRELLIS.2-Text-to-3D
๐Ÿ•น๏ธ GitHub: https://github.com/PRITHIVSAKTHIUR/TRELLIS.2-Text-to-3D-RERUN
๐Ÿ•น๏ธ Collection: https://huggingface.co/collections/prithivMLmods/multimodal-implementations

To know more about it, visit the app page or the respective model page!
prithivMLmodsย 
posted an update 3 months ago
view post
Post
4285
Introducing the Qwen-Image-Edit-2511-LoRAs-Fast demo, featuring image property comparison and contrast, built on top of Gradio and the combined Rerun SDK. It supports single and multi-image edits with existing LoRAs that are lazily loaded. (Note: This is still an experimental Space for Qwen-Image-Edit-2511.)

โญ Space Demo: prithivMLmods/Qwen-Image-Edit-2511-LoRAs-Fast
โญ GitHub: https://github.com/PRITHIVSAKTHIUR/Qwen-Image-Edit-2511-LoRAs-Fast-Multi-Image-Rerun
โญ Collection: https://huggingface.co/collections/prithivMLmods/image-generation-apps-collection

To know more about it, visit the app page or the respective model page!
  • 2 replies
ยท
telcomย 
posted an update 3 months ago
view post
Post
268
NVIDIAโ€™s Groq deal ... I think, inference efficiency is becoming the main driver of profitability, and NVIDIAโ€™s Groq deal is evidence the market is moving from โ€œwho can train biggestโ€ to โ€œwho can serve cheapest and fastest at scale.โ€ That points to a maturing phase of AI, not necessarily the end of a bubble, but definitely a correction in what โ€œwinsโ€ long-term.
What do you think?
  • 2 replies
ยท