Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up
Violet Xiang's picture
6 4 1

Violet Xiang PRO

violetxi
AlgoDistill's profile picture John6666's profile picture
·
  • violetxi

AI & ML interests

None yet

Recent Activity

upvoted a paper 3 days ago
GLM-5: from Vibe Coding to Agentic Engineering
updated a model 3 days ago
violetxi/opd_tooluse_qwen3-4b_trained_teacher_forward_kl_bs256_lr5e-6
published a model 3 days ago
violetxi/opd_tooluse_qwen3-4b_trained_teacher_forward_kl_bs256_lr5e-6
View all activity

Organizations

RLAIF's profile picture SynthLabs's profile picture Stanford University's profile picture Stanford Autonomous Agent Lab's profile picture Radical Numerics's profile picture ST Projects's profile picture

violetxi 's collections 1

ExpRL
Trained ExpRL checkpoints. Paper link: https://arxiv.org/abs/2606.17024
  • violetxi/ExpRL-Outcome-Qwen3-4B-Instruct

    4B • Updated 4 days ago • 8
  • violetxi/ExpRL-Process-Qwen3-4B-Instruct

    4B • Updated 4 days ago • 14
  • violetxi/ExpRL-Outcome-Qwen3-8B

    8B • Updated 4 days ago • 16
  • ExpRL: Exploratory RL for LLM Mid-Training

    Paper • 2606.17024 • Published 6 days ago • 4
ExpRL
Trained ExpRL checkpoints. Paper link: https://arxiv.org/abs/2606.17024
  • violetxi/ExpRL-Outcome-Qwen3-4B-Instruct

    4B • Updated 4 days ago • 8
  • violetxi/ExpRL-Process-Qwen3-4B-Instruct

    4B • Updated 4 days ago • 14
  • violetxi/ExpRL-Outcome-Qwen3-8B

    8B • Updated 4 days ago • 16
  • ExpRL: Exploratory RL for LLM Mid-Training

    Paper • 2606.17024 • Published 6 days ago • 4
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs