Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
In a Training Loop 🔄
20
5
18
Void
Parveshiiii
Follow
Simmons58932's profile picture
Devishetty100's profile picture
ermias243's profile picture
149 followers
·
34 following
parveshiiii
AI & ML interests
I love deep neural nets.
Recent Activity
reacted
to
their
post
with 🔥
about 10 hours ago
🚀 Wanna train your own AI Model or Tokenizer from scratch? Building models isn’t just for big labs anymore — with the right data, compute, and workflow, you can create **custom AI models** and **tokenizers** tailored to any domain. Whether it’s NLP, domain‑specific datasets, or experimental architectures, training from scratch gives you full control over vocabulary, embeddings, and performance. ✨ Why train your own? - Full control over vocabulary & tokenization - Domain‑specific optimization (medical, legal, technical, etc.) - Better performance on niche datasets - Freedom to experiment with architectures ⚡ The best part? - Tokenizer training (TikToken / BPE) can be done in **just 3 lines of code**. - Model training runs smoothly on **Google Colab notebooks** — no expensive hardware required. 📂 Try out my work: - 🔗 https://github.com/OE-Void/Tokenizer-from_scratch - 🔗 https://github.com/OE-Void/GPT
posted
an
update
about 11 hours ago
🚀 Wanna train your own AI Model or Tokenizer from scratch? Building models isn’t just for big labs anymore — with the right data, compute, and workflow, you can create **custom AI models** and **tokenizers** tailored to any domain. Whether it’s NLP, domain‑specific datasets, or experimental architectures, training from scratch gives you full control over vocabulary, embeddings, and performance. ✨ Why train your own? - Full control over vocabulary & tokenization - Domain‑specific optimization (medical, legal, technical, etc.) - Better performance on niche datasets - Freedom to experiment with architectures ⚡ The best part? - Tokenizer training (TikToken / BPE) can be done in **just 3 lines of code**. - Model training runs smoothly on **Google Colab notebooks** — no expensive hardware required. 📂 Try out my work: - 🔗 https://github.com/OE-Void/Tokenizer-from_scratch - 🔗 https://github.com/OE-Void/GPT
liked
a dataset
about 12 hours ago
Modotte/MathX-20M
View all activity
Organizations
Parveshiiii
's datasets
4
Sort: Recently updated
Parveshiiii/Complete-it
Viewer
•
Updated
Oct 2, 2025
•
190k
•
32
•
2
Parveshiiii/AI-vs-Real
Viewer
•
Updated
Sep 25, 2025
•
14k
•
295
•
5
Parveshiiii/Embedder
Viewer
•
Updated
Sep 22, 2025
•
990k
•
15
•
2
Parveshiiii/opencode_reasoning_filtered
Viewer
•
Updated
Jul 8, 2025
•
568k
•
74
•
4