AI & ML interests

None defined yet.

Recent Activity

ParveshiiiiĀ 
posted an update 6 days ago
view post
Post
1545
šŸš€ Wanna train your own AI Model or Tokenizer from scratch?

Building models isn’t just for big labs anymore — with the right data, compute, and workflow, you can create **custom AI models** and **tokenizers** tailored to any domain. Whether it’s NLP, domain‑specific datasets, or experimental architectures, training from scratch gives you full control over vocabulary, embeddings, and performance.

✨ Why train your own?
- Full control over vocabulary & tokenization
- Domain‑specific optimization (medical, legal, technical, etc.)
- Better performance on niche datasets
- Freedom to experiment with architectures

⚔ The best part?
- Tokenizer training (TikToken / BPE) can be done in **just 3 lines of code**.
- Model training runs smoothly on **Google Colab notebooks** — no expensive hardware required.

šŸ“‚ Try out my work:
- šŸ”— https://github.com/OE-Void/Tokenizer-from_scratch
- šŸ”— https://github.com/OE-Void/GPT
ParveshiiiiĀ 
posted an update 11 days ago
view post
Post
216
šŸ“¢ The Announcement
Subject: XenArcAI is now Modotte – A New Chapter Begins! šŸš€

Hello everyone,

We are thrilled to announce that XenArcAI is officially rebranding to Modotte!

Since our journey began, we’ve been committed to pushing the boundaries of AI through open-source innovation, research, and high-quality datasets. As we continue to evolve, we wanted a name that better represents our vision for a modern, interconnected future in the tech space.

What is changing?

The Name: Moving forward, all our projects, models, and community interactions will happen under the Modotte banner.

The Look: You’ll see our new logo and a fresh color palette appearing across our platforms.

What is staying the same?

The Core Team: It’s still the same people behind the scenes, including our founder, Parvesh Rawal.

Our Mission: We remain dedicated to releasing state-of-the-art open-source models and datasets.

Our Continuity: All existing models, datasets, and projects will remain exactly as they are—just with a new home.

This isn’t just a change in appearance; it’s a commitment to our next chapter of growth and discovery. We are so grateful for your ongoing support as we step into this new era.

Welcome to the future. Welcome to Modotte.

Best regards, The Modotte Team
JDhruv14Ā 
in IndianAIDevs/README 25 days ago

Let's Talk about AI

91
#1 opened 6 months ago by
kalashshah19

Let's Talk about AI

91
#1 opened 6 months ago by
kalashshah19
AbhaykoulĀ 
in IndianAIDevs/README about 1 month ago

Let's Talk about AI

91
#1 opened 6 months ago by
kalashshah19
vasistha2kĀ 
in IndianAIDevs/README about 1 month ago

Let's Talk about AI

91
#1 opened 6 months ago by
kalashshah19
Jainam-11Ā 
in IndianAIDevs/README about 1 month ago

Let's Talk about AI

91
#1 opened 6 months ago by
kalashshah19
ParveshiiiiĀ 
posted an update about 1 month ago
view post
Post
3572
Hey everyone!
We’re excited to introduce our new Telegram group: https://t.me/XenArcAI

This space is built for **model builders, tech enthusiasts, and developers** who want to learn, share, and grow together. Whether you’re just starting out or already deep into AI/ML, you’ll find a supportive community ready to help with knowledge, ideas, and collaboration.

šŸ’” Join us to:
- Connect with fellow developers and AI enthusiasts
- Share your projects, insights, and questions
- Learn from others and contribute to a growing knowledge base

šŸ‘‰ If you’re interested, hop in and be part of the conversation: https://t.me/XenArcAI
Ā·
kalashshah19Ā 
in IndianAIDevs/README about 1 month ago
KingNishĀ 
posted an update about 2 months ago
view post
Post
2684
Muon vs MuonClip vs Muon+Adamw

Muon has gone from an experiment to a mainstream optimizer, but does it hold up for fine‑tuning? We ran head‑to‑head tests on Qwen3‑4B (10k+ high‑quality instruction rows) to find out.

Short story: Pure Muon converged fastest at the start, but its gradient‑norm spikes made training unstable. MuonClip (Kimi K2’s clipping) stabilizes long pretraining runs, yet in our small‑scale fine‑tune it underperformed, lower token accuracy and slower convergence. The winner was the hybrid: Muon for 2D layers + AdamW for 1D layers. It delivered the best balance of stability and final performance and even beat vanilla AdamW.

Takeaway: for small-scale fine-tuning, hybrid = practical and reliable.

Next Step: scale to larger models/datasets to see if Muon’s spikes become catastrophic or if clipping wins out.

Full Blog Link: https://huggingface.co/blog/KingNish/optimizer-part1
KingNishĀ 
posted an update about 2 months ago
ParveshiiiiĀ 
posted an update 3 months ago
view post
Post
1656
Another banger from XenArcAI! šŸ”„

We’re thrilled to unveil three powerful new releases that push the boundaries of AI research and development:

šŸ”— https://huggingface.co/XenArcAI/SparkEmbedding-300m

- A lightning-fast embedding model built for scale.
- Optimized for semantic search, clustering, and representation learning.

šŸ”— https://huggingface.co/datasets/XenArcAI/CodeX-7M-Non-Thinking

- A massive dataset of 7 million code samples.
- Designed for training models on raw coding patterns without reasoning layers.

šŸ”— https://huggingface.co/datasets/XenArcAI/CodeX-2M-Thinking

- A curated dataset of 2 million code samples.
- Focused on reasoning-driven coding tasks, enabling smarter AI coding assistants.

Together, these projects represent a leap forward in building smarter, faster, and more capable AI systems.

šŸ’” Innovation meets dedication.
šŸŒ Knowledge meets responsibility.


ParveshiiiiĀ 
posted an update 3 months ago
view post
Post
3056
SparkEmbedding - SoTA cross lingual retrieval

Iam very happy to announce our latest embedding model sparkembedding-300m base on embeddinggemma-300m we fine tuned it on 1m extra examples spanning over 119 languages and result is this model achieves exceptional cross lingual retrieval

Model: https://huggingface.co/XenArcAI/SparkEmbedding-300m
kalashshah19Ā 
in IndianAIDevs/README 3 months ago

General

26
#5 opened 4 months ago by
kalashshah19
AbhaykoulĀ 
in IndianAIDevs/README 3 months ago

General

26
#5 opened 4 months ago by
kalashshah19
Neural-HackerĀ 
in IndianAIDevs/README 3 months ago

General

26
#5 opened 4 months ago by
kalashshah19