Building on HF

Ujjwal Tyagi

Ujjwal-Tyagi

AI & ML interests

Chief Scientist at Shirova AI, focused on advancing open-source AI, Experienced in LLM fine-tuning, model architecture, and research, with a strong interest in building scalable and efficient models

Recent Activity

liked a model 1 day ago

Zyphra/ZAYA1-8B

liked a dataset 1 day ago

1aurent/BACH

liked a model 1 day ago

houyuanchen/UniVidX

View all activity

Organizations

liked a model 1 day ago

Zyphra/ZAYA1-8B

9B • Updated 3 days ago • 66.1k • 399

liked a dataset 1 day ago

1aurent/BACH

Viewer • Updated May 25, 2024 • 462 • 569 • 1

liked a model 1 day ago

houyuanchen/UniVidX

Updated 7 days ago • 16

replied to DedeProGames's post 1 day ago

Oh wow, good

upvoted a paper 2 days ago

AI Co-Mathematician: Accelerating Mathematicians with Agentic AI

Paper • 2605.06651 • Published 4 days ago • 10

upvoted 2 collections 5 days ago

RL+reason model

Collection

259 items • Updated 15 days ago • 1

Research Paper Categories

Collection

25 items • Updated 15 days ago • 1

liked a model 7 days ago

XiaomiMiMo/MiMo-V2.5-Pro

Text Generation • 1T • Updated 3 days ago • 41.7k • 506

liked a dataset 8 days ago

nvidia/Nemotron-Image-Training-v3

Viewer • Updated 13 days ago • 6.92M • 5.73k • 60

posted an update 8 days ago

Post

200

6 Open-Source Libraries to FineTune LLMs
1. Unsloth
GitHub: https://github.com/unslothai/unsloth
→ Fastest way to fine-tune LLMs locally
→ Optimized for low VRAM (even laptops)
→ Plug-and-play with Hugging Face models

2. Axolotl
GitHub: https://github.com/OpenAccess-AI-Collective/axolotl
→ Flexible LLM fine-tuning configs
→ Supports LoRA, QLoRA, multi-GPU
→ Great for custom training pipelines

3. TRL (Transformer Reinforcement Learning)
GitHub: https://github.com/huggingface/trl
→ RLHF, DPO, PPO for LLM alignment
→ Built on Hugging Face ecosystem
→ Essential for post-training optimization

4. DeepSpeed
GitHub: https://github.com/microsoft/DeepSpeed
→ Train massive models efficiently
→ Memory + speed optimization
→ Industry standard for scaling

5. LLaMA-Factory
GitHub: https://github.com/hiyouga/LLaMA-Factory
→ All-in-one fine-tuning UI + CLI
→ Supports multiple models (LLaMA, Qwen, etc.)
→ Beginner-friendly + powerful

6. PEFT
GitHub: https://github.com/huggingface/peft
→ Fine-tune with minimal compute
→ LoRA, adapters, prefix tuning
→ Best for cost-efficient training