Nishith Jain's picture

🔄 In a Training Loop

Nishith Jain

KingNish

·

AI & ML interests

AI is fun actually.

Recent Activity

liked a Space 2 days ago

build-small-hackathon/OpenMythos

liked a Space 2 days ago

liked a model 4 days ago

jabbatheduck/OpenMythos-GGUF

View all activity

Organizations

upvoted 2 articles 7 days ago

Article

I fine-tuned a model for free from one prompt, with TRL and the Google Colab CLI

sergiopaniego

•

12 days ago

• 4

Article

Beyond LoRA: Can you beat the most popular fine-tuning technique?

+2

BenjaminB, sayakpaul, hubnemo, kashif

•

10 days ago

• 62

upvoted an article 11 days ago

Article

How We Built OpenMythos: A Cybersecurity LLM Trained from Scratch

KingNish

•

12 days ago

• 4

upvoted a paper 4 months ago

OmniLottie: Generating Vector Animations via Parameterized Lottie Tokens

Paper • 2603.02138 • Published Mar 2 • 151

upvoted a changelog 4 months ago

Hugging Face Changelog

Find All Your Blog Drafts in One Place

Feb 2

• 51

upvoted an article 4 months ago

Article

GEM Image: Building an AI That Actually Gets Educational Diagrams Right

AIPreplabs

•

Feb 21

• 5

upvoted a paper 4 months ago

DeepGen 1.0: A Lightweight Unified Multimodal Model for Advancing Image Generation and Editing

Paper • 2602.12205 • Published Feb 13 • 83

upvoted 3 papers 5 months ago

OPUS: Towards Efficient and Principled Data Selection in Large Language Model Pre-training in Every Iteration

Paper • 2602.05400 • Published Feb 5 • 356

Entropy-Adaptive Fine-Tuning: Resolving Confident Conflicts to Mitigate Forgetting

Paper • 2601.02151 • Published Jan 5 • 115

On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification

Paper • 2508.05629 • Published Aug 7, 2025 • 190

upvoted an article 5 months ago

Article

Unlocking Agentic RL Training for GPT-OSS: A Practical Retrospective

LinkedIn

•

Jan 27

• 80

upvoted a paper 5 months ago

SAMTok: Representing Any Mask with Two Words

Paper • 2601.16093 • Published Jan 22 • 44

upvoted an article 5 months ago

Article

How We Built a Semantic Highlight Model To Save Token Cost for RAG

zilliz

•

Jan 15

• 67

upvoted an article 6 months ago

Article

The Optimal Architecture for Small Language Models

codelion

•

Dec 26, 2025

• 121

upvoted a paper 6 months ago

Nested Learning: The Illusion of Deep Learning Architectures

Paper • 2512.24695 • Published Dec 31, 2025 • 46

upvoted an article 6 months ago

Article

Decoding Strategies in Large Language Models

mlabonne

•

Oct 29, 2024

• 114

upvoted 3 articles 7 months ago

Article

New in llama.cpp: Model Management

ggml-org

•

Dec 11, 2025

• 137

Article

Muon vs MuonClip vs Muon+AdamW for Fine-Tuning

KingNish

•

Dec 9, 2025

• 14

Article

Tensor Parallelism (TP) in Transformers: 5 Minutes to Understand

qgallouedec

•

Dec 4, 2025

• 72

upvoted a paper 7 months ago

PretrainZero: Reinforcement Active Pretraining

Paper • 2512.03442 • Published Dec 3, 2025 • 50