AI & ML interests

None defined yet.

KingNishย 
posted an update 13 days ago
view post
Post
4304
We trained an open-source Mythos like cybersecurity LLM for the Build Small Hackathon meet OpenMythos

Trained in two stages: SFT on ~1.84K filtered ArXiv cs.CR papers + real CVE data, then RLVR using paired with past vulnerabilities GitHub repos with a verifier model checking outputs against ground truth.

Trained on: H100s from Modal

The RLVR stage made the biggest difference responses got more precise and less prone to confusing similar vulnerability classes.

Everything is open:
๐Ÿค– Demo โ†’ build-small-hackathon/OpenMythos
๐Ÿง  Model โ†’ build-small-hackathon/OpenMythos
๐Ÿ“ฆ CVE Dataset โ†’ build-small-hackathon/CVE_Vulnerailities_Detailed
๐Ÿ“„ ArXiv Dataset โ†’ himanshu17HF/ArvixImport-Filtered-Final

Try it out and let us know where it breaks ๐Ÿ™
  • 2 replies
ยท
KingNishย 
posted an update 7 months ago
view post
Post
3817
Muon vs MuonClip vs Muon+Adamw

Muon has gone from an experiment to a mainstream optimizer, but does it hold up for fineโ€‘tuning? We ran headโ€‘toโ€‘head tests on Qwen3โ€‘4B (10k+ highโ€‘quality instruction rows) to find out.

Short story: Pure Muon converged fastest at the start, but its gradientโ€‘norm spikes made training unstable. MuonClip (Kimi K2โ€™s clipping) stabilizes long pretraining runs, yet in our smallโ€‘scale fineโ€‘tune it underperformed, lower token accuracy and slower convergence. The winner was the hybrid: Muon for 2D layers + AdamW for 1D layers. It delivered the best balance of stability and final performance and even beat vanilla AdamW.

Takeaway: for small-scale fine-tuning, hybrid = practical and reliable.

Next Step: scale to larger models/datasets to see if Muonโ€™s spikes become catastrophic or if clipping wins out.

Full Blog Link: https://huggingface.co/blog/KingNish/optimizer-part1
KingNishย 
posted an update 7 months ago
KingNishย 
posted an update 11 months ago
KingNishย 
posted an update about 1 year ago
view post
Post
1237
What's currently the biggest gap in Open Source Datasets ??
  • 5 replies
ยท
KingNishย 
posted an update over 1 year ago
KingNishย 
posted an update almost 2 years ago
view post
Post
8336
Exciting news! Introducing super-fast AI video assistant, currently in beta. With a minimum latency of under 500ms and an average latency of just 600ms.

DEMO LINK:
KingNish/Live-Video-Chat
  • 1 reply
ยท
KingNishย 
posted an update almost 2 years ago
KingNishย 
posted an update almost 2 years ago
view post
Post
3649
Mistral Nemo is better than many models in 1st grader level reasoning.
KingNishย 
posted an update almost 2 years ago
view post
Post
3980
I am experimenting with Flux and trying to push it to its limits without training (as I am GPU-poor ๐Ÿ˜…).
I found some flaws in the pipelines, which I resolved, and now I am able to generate an approx similar quality image as Flux Schnell 4 steps in just 1 step.
Demo Link:
KingNish/Realtime-FLUX

  • 1 reply
ยท
KingNishย 
posted an update almost 2 years ago
view post
Post
1986
I am excited to announce a major speed updated in Voicee, a superfast voice assistant.

It has now achieved latency <250 ms.
While its average latency is about 500ms.
KingNish/Voicee

This become Possible due to newly launched @sambanovasystems cloud.

You can also use your own API Key to get fastest speed.
You can get on from here: https://cloud.sambanova.ai/apis

For optimal performance use Google Chrome.

Please try Voicee and share your valuable feedback to help me further improve its performance and usability.
Thank you!
KingNishย 
posted an update almost 2 years ago
view post
Post
3638
Introducing Voicee, A superfast voice fast assistant.
KingNish/Voicee
It achieved latency <500 ms.
While its average latency is 700ms.
It works best in Google Chrome.
Please try and give your feedbacks.
Thank you. ๐Ÿค—
  • 3 replies
ยท
KingNishย 
posted an update almost 2 years ago
view post
Post
5935
Introducing OpenCHAT mini: a lightweight, fast, and unlimited version of OpenGPT 4o.

KingNish/OpenCHAT-mini2

It has unlimited web search, vision and image generation.

Please take a look and share your review. Thank you! ๐Ÿค—
  • 7 replies
ยท
KingNishย 
posted an update about 2 years ago
view post
Post
15178
OpenGPT 4o now features WEB SEARCH

This feature enhances the capabilities of OpenGPT 4o, allowing it to fetch and integrate the latest information from the web directly into its responses.
Try Now: KingNish/OpenGPT-4o

With WEB SEARCH, OpenGPT 4o becomes an even more versatile and dynamic AI, ready to assist with up-to-date data retrieval and analysis.
  • 30 replies
ยท
KingNishย 
posted an update about 2 years ago
view post
Post
6509
I am pleased to announce 2 amazing AI demos:

1. Chat with Google Agent - This includes three AI models that allow you to converse with an AI, which provides answers by searching Google.
Demo Link: poscye/google-go

2. HelpingAI 9B - A model that surpassed all top AIs with the highest EQ benchmark score of 89.23. It specializes in understanding human emotions and responding in human style.
Demo Link: https://huggingface.co/spaces/Abhaykoul/HelpingAI-9B
Model Link: https://huggingface.co/OEvortex/HelpingAI-9B
Blog Link: https://huggingface.co/blog/KingNish/helpingai-9b
  • 2 replies
ยท
KingNishย 
posted an update about 2 years ago
view post
Post
3800
ChatGPT made Custom GPTs Free for Everyone.

Yes, you can use them but...
with limitations like
You can't use DallE ๐Ÿ˜ฅ,
You can't make Custom GPTs
And chat limit also๐Ÿ˜ฅ.
But...
We already have an open-source alternative like Hugging Chat, where you can create your custom assistant, generate, edit images, without any chat limit.

Try both of them from here:
https://chatgpt.com/gpts
https://huggingface.co/chat

and don't forget to Give your review here ๐Ÿ‘‡:
  • 4 replies
ยท
KingNishย 
posted an update about 2 years ago
view post
Post
2721
Introducing Image Generator Pro
KingNish/Image-Gen-Pro

It is Expert in Text to Image generation, Sequential Image generation or Image Editing.

Examples:
  • 5 replies
ยท
KingNishย 
posted an update about 2 years ago
view post
Post
2973
OpenGPT 4o NEW UPDATES:
1. Dedicated Image and Video Engine
2. Model Choices for Voice Chat
3. Better and Faster Voice Chat
4. Various Bug fixes

Test and give feedback of New features:
KingNish/OpenGPT-4o

Future Updates:
1. Web Search (Suggested by @GPT007 and @Saionton )
2. Live Chat with Voice Chat
3. Model Choices (Suggested by @NotAiLOL )
4. Multilingual Chats.

Suggest more features that should be added. ๐Ÿค—
Thanks!
  • 6 replies
ยท
KingNishย 
posted an update about 2 years ago
view post
Post
4674
Microsoft Just Launched 3 Powerful Models

1. Phi 3 Medium (4k and 128k): A 14b Instruct tuned models that outperformed big models like Command R+ (104b), GPT 3.5 Pro, Gemini Pro, and is highly competitive with top models such as Mixtral 8x22b, Llama3 70B, and GPT 4.
microsoft/Phi-3-medium-4k-instruct
DEMO: https://huggingface.co/spaces/Walmart-the-bag/Phi-3-Medium

2. Phi 3 Mini Vision 128k: A 4.5 billion-parameter, instruction-tuned vision model that has outperformed models such as Llava3 and Claude 3, and is providing stiff competition to Gemini 1Pro Vision.
microsoft/Phi-3-vision-128k-instruct

3. Phi3 Small (8k and 128k): Better than Llama3 8b, Mixtral 8x7b and GPT 3.5 turbo.
microsoft/Phi-3-small-128k-instruct
  • 6 replies
ยท
KingNishย 
posted an update about 2 years ago
view post
Post
5086
Decoding GPT-4'o': Its Mechanisms and Creating Similar AI.

๐—ฅ๐—ฒ๐—ฎ๐—ฑ ๐—™๐˜‚๐—น๐—น ๐€๐ซ๐ญ๐ข๐œ๐ฅ๐ž: https://huggingface.co/blog/KingNish/decoding-gpt-4o

๐’๐ฎ๐ฆ๐ฆ๐š๐ซ๐ฒ ๐จ๐Ÿ ๐€๐ซ๐ญ๐ข๐œ๐ฅ๐ž- ๐Ÿ“
# ๐Œ๐ž๐œ๐ก๐š๐ง๐ข๐œ๐ฌ ๐จ๐Ÿ ๐†๐๐“-๐Ÿ’โ€™๐จโ€™: GPT-4โ€™oโ€™ operates through three main components ๐Ÿ› ๏ธ

๐Ÿ. ๐’๐ฎ๐ฉ๐ž๐ซ๐‚๐ก๐š๐ญ: Integrates image generation, QnA (image, document and video) for diverse interactions.
๐Ÿ. ๐•๐จ๐ข๐œ๐ž ๐‚๐ก๐š๐ญ: Merges TTS and STT for real-time, human-like audio responses, focusing on human interaction.
๐Ÿ‘. ๐•๐ข๐๐ž๐จ ๐‚๐ก๐š๐ญ: Utilizes Zero Shot Image Classification to enhance user interaction with visual information.

# ๐Œ๐ž๐ญ๐ก๐จ๐๐ฌ ๐ญ๐จ ๐‚๐ซ๐ž๐š๐ญ๐ž ๐’๐ข๐ฆ๐ข๐ฅ๐š๐ซ ๐€๐ˆ ๐Ÿง 

๐Ÿ. ๐Œ๐ฎ๐ฅ๐ญ๐ข๐Œ๐จ๐๐š๐ฅ๐ข๐Ÿ๐ข๐œ๐š๐ญ๐ข๐จ๐ง: Combines multiple models for a powerful, multifunctional AI.
๐Ÿ. ๐ƒ๐ฎ๐œ๐ญ ๐“๐š๐ฉ๐ž ๐Œ๐ž๐ญ๐ก๐จ๐: Uses different models or APIs for specific tasks without additional training.

The article provides an in-depth exploration of GPT-4โ€™oโ€™, its functionalities, and methods to create similar AI models. It emphasizes the modelโ€™s language support and its innovative approach to human-AI interaction. ๐Ÿ’ก๐ŸŒ

(๐™‰๐™Š๐™๐™€: ๐™Ž๐™ช๐™ข๐™ข๐™–๐™ง๐™ฎ ๐™ž๐™จ ๐˜ผ๐™„ ๐™œ๐™š๐™ฃ๐™š๐™ง๐™–๐™ฉ๐™š๐™™) โœ…
  • 2 replies
ยท