Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

RLLab

https://github.com
Activity Feed

AI & ML interests

None defined yet.

Jixuan Leng's profile picture

JixuanLeng 
updated a collection 2 months ago

DPO

Collection
4 items • Updated Mar 3
JixuanLeng 
updated a model 2 months ago

RLLab/gemma-3-4b-text-pt

Text Generation • 4B • Updated Mar 1 • 4
JixuanLeng 
updated a collection 2 months ago

DPO

Collection
4 items • Updated Mar 3
JixuanLeng 
updated a dataset 2 months ago

RLLab/allenai-Dolci-Instruct-DPO-Length-Filtered

Viewer • Updated Mar 1 • 146k • 5
JixuanLeng 
updated a model 2 months ago

RLLab/gemma-3-4b-text-sft

Text Generation • 4B • Updated Feb 28 • 24
JixuanLeng 
published a model 2 months ago

RLLab/gemma-3-4b-text-sft

Text Generation • 4B • Updated Feb 28 • 24
JixuanLeng 
updated a collection 2 months ago

DPO

Collection
4 items • Updated Mar 3
JixuanLeng 
authored a paper about 1 year ago

CrossWordBench: Evaluating the Reasoning Capabilities of LLMs and LVLMs with Controllable Puzzle Generation

Paper • 2504.00043 • Published Mar 30, 2025 • 10
JixuanLeng 
authored 2 papers over 1 year ago

S$^{2}$FT: Efficient, Scalable and Generalizable LLM Fine-tuning by Structured Sparsity

Paper • 2412.06289 • Published Dec 9, 2024

Taming Overconfidence in LLMs: Reward Calibration in RLHF

Paper • 2410.09724 • Published Oct 13, 2024 • 3
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs