Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Zhiyuan He's picture
1

Zhiyuan He

nickhe
ยท
  • nichezy

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago
InfoSeeker: A Scalable Hierarchical Parallel Agent Framework for Web Information Seeking
published a model 3 months ago
nickhe/firl-ckpt-720
published a model 3 months ago
nickhe/firl-ckpt-760
View all activity

Organizations

University College London's profile picture

nickhe 's collections 1

FIRL-Abalone-REINFORCE++
Saved LORA adapter checkpoints from training Qwen2.5-7B to generate decision trees for Abalone age regression dataset, using reinforce++ algorithm.
  • nickhe/firl-ckpt-20-40-60

    Updated Jan 23
  • nickhe/firl-ckpt-100

    Updated Jan 23
  • nickhe/firl-ckpt-120

    Updated Jan 23
  • nickhe/firl-ckpt-140

    Updated Jan 23
FIRL-Abalone-REINFORCE++
Saved LORA adapter checkpoints from training Qwen2.5-7B to generate decision trees for Abalone age regression dataset, using reinforce++ algorithm.
  • nickhe/firl-ckpt-20-40-60

    Updated Jan 23
  • nickhe/firl-ckpt-100

    Updated Jan 23
  • nickhe/firl-ckpt-120

    Updated Jan 23
  • nickhe/firl-ckpt-140

    Updated Jan 23
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs