Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Vaibhavi Singh's picture
3

Vaibhavi Singh

contactvaibhavi
·
https://www.vaibhavisingh.com/
  • __Vaibhavi
  • contactvaibhavi
  • contactvaibhavi

AI & ML interests

NLP

Organizations

Hugging Face Discord Community's profile picture Stealth's profile picture

upvoted a collection 7 months ago

Reward Models 06-2025

Collection
Nemotron reward models. For use in RLHF pipelines and LLM-as-a-Judge • 8 items • Updated 1 day ago • 23
upvoted 2 collections 8 months ago

OLMo-1B-as_fm3_tg_omi2

Collection
OLMo 1B model pretrained with Algebraic Stack, FineMath3, TinyGSM, and OpenMathInstruct2. Includes checkpoints from doing PPO using GSM8K train. • 25 items • Updated 5 days ago • 1

OLMo-1B-as_fm3_tg_omi1_omi2

Collection
OLMo 1B model pretrained with Algebraic Stack, FineMath3, TinyGSM, OMI1, and OMI2. Includes checkpoints from doing PPO using GSM8K train. • 25 items • Updated 5 days ago • 2
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs