AI & ML interests

None defined yet.

Recent Activity

JixuanLeng updated a model 23 days ago

RLLab/Qwen3-1.7B-Base-GRPO

JixuanLeng published a model 23 days ago

RLLab/Qwen3-1.7B-Base-GRPO

JixuanLeng updated a collection 4 months ago

DPO

View all activity

JixuanLeng

updated a model 23 days ago

RLLab/Qwen3-1.7B-Base-GRPO

Text Generation • 2B • Updated 23 days ago • 170

JixuanLeng

published a model 23 days ago

RLLab/Qwen3-1.7B-Base-GRPO

Text Generation • 2B • Updated 23 days ago • 170

JixuanLeng

updated a collection 4 months ago

DPO

Collection

4 items • Updated Mar 3

JixuanLeng

updated a model 4 months ago

RLLab/gemma-3-4b-text-pt

Text Generation • 4B • Updated Mar 1 • 6

JixuanLeng

updated a collection 4 months ago

DPO

Collection

4 items • Updated Mar 3

JixuanLeng

updated a dataset 4 months ago

RLLab/allenai-Dolci-Instruct-DPO-Length-Filtered

Viewer • Updated Mar 1 • 146k • 3

JixuanLeng

updated a model 4 months ago

RLLab/gemma-3-4b-text-sft

4B • Updated Feb 28 • 3

JixuanLeng

published a model 4 months ago

RLLab/gemma-3-4b-text-sft

4B • Updated Feb 28 • 3

JixuanLeng

updated a collection 4 months ago

DPO

Collection

4 items • Updated Mar 3

JixuanLeng

authored a paper about 1 year ago

CrossWordBench: Evaluating the Reasoning Capabilities of LLMs and LVLMs with Controllable Puzzle Generation

Paper • 2504.00043 • Published Mar 30, 2025 • 10

JixuanLeng

authored 2 papers over 1 year ago

S$^{2}$FT: Efficient, Scalable and Generalizable LLM Fine-tuning by Structured Sparsity

Paper • 2412.06289 • Published Dec 9, 2024

Taming Overconfidence in LLMs: Reward Calibration in RLHF

Paper • 2410.09724 • Published Oct 13, 2024 • 3

AI & ML interests

Recent Activity

Team members 1

RLLab's activity