AI & ML interests

A Family of Dynamic UltraFast Small Language Models Ready for Embodied Artificial General Intelligence!

Recent Activity

KingNishย 
posted an update 9 days ago
view post
Post
4215
We trained an open-source Mythos like cybersecurity LLM for the Build Small Hackathon meet OpenMythos

Trained in two stages: SFT on ~1.84K filtered ArXiv cs.CR papers + real CVE data, then RLVR using paired with past vulnerabilities GitHub repos with a verifier model checking outputs against ground truth.

Trained on: H100s from Modal

The RLVR stage made the biggest difference responses got more precise and less prone to confusing similar vulnerability classes.

Everything is open:
๐Ÿค– Demo โ†’ build-small-hackathon/OpenMythos
๐Ÿง  Model โ†’ build-small-hackathon/OpenMythos
๐Ÿ“ฆ CVE Dataset โ†’ build-small-hackathon/CVE_Vulnerailities_Detailed
๐Ÿ“„ ArXiv Dataset โ†’ himanshu17HF/ArvixImport-Filtered-Final

Try it out and let us know where it breaks ๐Ÿ™
Abhaykoulย 
posted an update 9 days ago
view post
Post
198
Shipped v0.1.2 of vtx โ€” a minimalist coding agent for the terminal.

Most agentic CLIs ship 10k+ token system prompts. Vtx is ~2,200. Less prompt overhead means more room for your code in the model's context window.

Vtx is a from-scratch Python implementation of the design philosophy behind pi-mono โ€” same principles, pure Python, no transpiled runtime.

What ships out of the box:

โ†’ Textual TUI + headless CLI (vtx -p "fix the failing test")
โ†’ 49 LLM provider gateways, all declared in a single provider.yaml
โ†’ 5 core tools (read / edit / write / bash / find) plus web search and fetch
โ†’ Session tree with compaction, handoff, and resume
โ†’ AGENTS.md / CLAUDE.md auto-discovery
โ†’ Skills system โ€” drop SKILL.md files in .agents/skills/ and they become slash commands
โ†’ Two OAuth flows (GitHub Copilot device flow, OpenAI Codex PKCE)
โ†’ Two-mode permissions: prompt (default) or auto, with a safe-command allowlist

This release adds a proper extension system. Register new LLM-callable tools, intercept tool calls, hook lifecycle events, and add slash commands from a single register(api) function in a Python file under ~/.vtx/agent/extensions/. Extensions can override built-in tools by name and chain handler logic across subscribers.

Apache 2.0. uv tool install vtx-coding-agent and you're running.

GitHub: https://github.com/OEvortex/vtx-coding-agent
PyPI: https://pypi.org/project/vtx-coding-agent

Built in the open. Feedback, extensions, and PRs welcome.
prithivMLmodsย 
posted an update 10 days ago
view post
Post
3926
Wan2.2-I2V-Fast with highly upscaled sequential frame sampling is now available as a Spaces demo, built using Wan2.2-I2V and FLUX.2-Klein. Try the demo using the links below.๐Ÿ‘‡

โž  wan2.2-i2v-fast : prithivMLmods/wan2.2-i2v-fast
โž  github: https://github.com/prithivsakthiur/wan2.2-i2v-fast
โž  collection: https://huggingface.co/collections/prithivMLmods/image-generation-apps-collection

โคท To learn more, visit the app page or the respective model pages.
prithivMLmodsย 
posted an update 25 days ago
prithivMLmodsย 
posted an update 28 days ago
view post
Post
6187
PiD โ€” Pixel Diffusion Decoder Image Edit Upscale and Image Generation Upscale, an all-in-one demo, is now live on Spaces! Great improvements in realism-based image generation and editing are powered by FLUX.2-Klein, while image generation is paired with Z-Image, and upscaling is enabled by default!

๐Ÿค— Space: prithivMLmods/PiD-Image-Upscaler
๐Ÿ”— Collection: https://huggingface.co/collections/prithivMLmods/image-generation-apps-collection

๐Ÿค— > To learn more, visit the app page or the respective model pages.
prithivMLmodsย 
posted an update about 1 month ago
view post
Post
5595
I've made 8 Spaces in the Qwen-Image-Edit series, and out of them, 5 Spaces reached โ€œSpace of the Weekโ€! A few Spaces are still topping the list even after many months.

Cumulatively, the series has crossed 8.2 million+ ZeroGPU runs and nearly 4 million visitors overall.

Thanks for all the community support! ๐Ÿค—โค๏ธ

๐Ÿ”— Spaces: https://huggingface.co/collections/prithivMLmods/image-generation-apps-collection
  • 4 replies
ยท
Shrijanagainย 
posted an update about 1 month ago
view post
Post
2609
We are pleased to announce that the W-IMG Vision Dataset infrastructure is officially live.

The complete asset infrastructure is now accessible on Hugging Face for internal validation and architecture scaling targets.

Dataset Endpoint - sKT-Ai-Labs/W-IMG

#SovereignAI #ComputerVision #MachineLearning #OpenSource
prithivMLmodsย 
posted an update about 2 months ago
view post
Post
5945
Multimodal-Edge Demo, a node-based inference canvas demo, is now live on Spaces. It features node-based Transformers for fast inference across 10+ edge-device multimodal models on the Hub, all within a single space. The series includes models from Qwen3.5, Qwen3-VL, Gemma 4, and the LFM 2.5 VL model series, with support for reasoning and grounding tasks.

๐Ÿค— Demo: prithivMLmods/Multimodal-Edge-Node
๐Ÿ”— GitHub: https://github.com/PRITHIVSAKTHIUR/Multimodal-Edge-Node
โœ… Multimodal Apps Collections: https://huggingface.co/collections/prithivMLmods/hall-of-multimodal-apps

๐Ÿค— > To learn more, visit the app page or the respective model pages.
prithivMLmodsย 
posted an update 2 months ago
view post
Post
1931
Now, a collection of various compression schemes for Qwen3.6 and the abliterated version 1 of dense models is available on the Hub. Check it out via the links below. ๐Ÿ‘‡

๐Ÿ”— Qwen3.6-MoE: https://huggingface.co/collections/prithivMLmods/qwen36-35b-a3b-compressions
๐Ÿ”— Qwen3.6-27B Compressions: https://huggingface.co/collections/prithivMLmods/qwen36-27b-compressions

๐Ÿค— > To learn more, visit the app page or the respective model pages.
prithivMLmodsย 
posted an update 2 months ago
view post
Post
4229
HY-World-2.0 โ€” A Multi-Modal World Model for Reconstructing, Generating, and Simulating 3D Worlds is now available on Spaces, and it works both as native Gradio components and in Gradio server mode.

> HY-World-2.0-Demo: prithivMLmods/HY-World-2.0-Demo
> HY-World-2.0 [Server Mode]: prithivMLmods/HY-World-2.0-Demo
> Featuring 3D reconstruction and Gaussian splats with the Rerun viewer, along with camera poses, depth maps, and surface normals.
> In Server Mode, Gradio is served via FastAPI, with FastAPI remaining the top-level server.
> Model: tencent/HY-World-2.0
> GitHub: https://github.com/PRITHIVSAKTHIUR/HY-World-2.0-Demo

๐Ÿค—To learn more, visit the app page or the respective model pages.
prithivMLmodsย 
posted an update 2 months ago
view post
Post
6249
A new comparator on Spaces showcases Standard FLUX.2 Decoder vs. FLUX.2 Small Decoder. The Small Decoder is ~1.4ร— faster, uses ~1.4ร— less VRAM, and maintains near-identical image quality. It has ~28M parameters with narrower channels [96, 192, 384, 384] vs. [128, 256, 512, 512], and the demo supports sequence generation by running both decoders simultaneously and comparing the results side by side.

๐Ÿค— Comparator: https://huggingface.co/spaces/prithivMLmods/Flux.2-4B-Decoder-Comparator
๐Ÿ”— FLUX.2-small-decoder: black-forest-labs/FLUX.2-small-decoder
๐Ÿ”— GitHub: https://github.com/PRITHIVSAKTHIUR/Flux.2-4B-Encoder-Comparator
๐Ÿš Collection: https://huggingface.co/collections/prithivMLmods/image-generation-apps-collection

๐Ÿค— > App built on the Gradio SDK. To learn more, visit the app page or the respective model pages.
prithivMLmodsย 
posted an update 2 months ago
view post
Post
4264
Now, a collection of various compression schemes for Gemma 4 and the abliterated version 1 of dense models is available on the Hub. Check it out via the links below. ๐Ÿ‘‡

๐Ÿ”—Gemma 4 Compression(s)- https://huggingface.co/collections/prithivMLmods/gemma-4-compressions
๐Ÿ”—Gemma 4 Uncensored [MAX] + Compression(s) - [`ฮฒ ]- https://huggingface.co/collections/prithivMLmods/gemma-4-uncensored-max-compressions
๐Ÿ”—Gemma 4 Compression(s) - MoE- https://huggingface.co/collections/prithivMLmods/gemma-4-compressions-moe
๐Ÿ”—Gemma-4 F32 GGUF- https://huggingface.co/collections/prithivMLmods/gemma-4-f32-gguf

๐Ÿค— > To learn more, visit the app page or the respective model pages.
prithivMLmodsย 
posted an update 3 months ago
view post
Post
2364
Now the demo for image detection based on SAM3 and Gemma-4 (*Filter) is available on Spaces, using full-fledged Transformers inference with multimodal reasoning for processed images. It also supports video segmentation (mask), video segmentation (annotation), and image click segmentation.

๐Ÿค— Demo Space: prithivMLmods/SAM3-Gemma4-CUDA
๐Ÿฅฝ SAM3: facebook/sam3
๐Ÿ”— gemma-4-E2B-it: google/gemma-4-E2B-it

To learn more, visit the app page or the respective model pages.
  • 1 reply
ยท
prithivMLmodsย 
posted an update 3 months ago
view post
Post
4790
The demo for Image Detection (*Filter) based on SAM3 and Qwen-3.5 is now available on Hugging Face Spaces using Transformers inference, with multimodal reasoning for processed images, and it also supports video segmentation (mask), video segmentation (annotation), and image click segmentation.

๐Ÿค— Demo Space: prithivMLmods/SAM3-Plus-Qwen3.5
๐Ÿฅฝ SAM3: facebook/sam3
๐Ÿ”— Qwen-3.5: Qwen/Qwen3.5-2B

To learn more, visit the app page or the respective model pages.
  • 5 replies
ยท