Zhiqing Sun's picture

Zhiqing Sun

zhiqings

·

https://www.cs.cmu.edu/~zhiqings/

AI & ML interests

None yet

Recent Activity

upvoted a paper 2 days ago

Spreadsheet-RL: Advancing Large Language Model Agents on Realistic Spreadsheet Tasks via Reinforcement Learning

authored a paper over 1 year ago

An Empirical Analysis of Compute-Optimal Inference for Problem-Solving with Language Models

authored a paper over 1 year ago

Lean-STaR: Learning to Interleave Thinking and Proving

View all activity

Organizations

upvoted a paper 2 days ago

Spreadsheet-RL: Advancing Large Language Model Agents on Realistic Spreadsheet Tasks via Reinforcement Learning

Paper • 2605.22642 • Published 3 days ago • 32

authored 2 papers over 1 year ago

An Empirical Analysis of Compute-Optimal Inference for Problem-Solving with Language Models

Paper • 2408.00724 • Published Aug 1, 2024 • 2

Lean-STaR: Learning to Interleave Thinking and Proving

Paper • 2407.10040 • Published Jul 14, 2024

updated 2 datasets almost 2 years ago

UCLAML/synthetic_data_mistral-7b-instruct-sppo-iter1_score

Viewer • Updated Aug 1, 2024 • 510 • 60

UCLAML/data-mistral-7b-instruct-sppo-iter1_generated

Viewer • Updated Aug 1, 2024 • 10 • 16

updated a collection almost 2 years ago

Lean-STaR

8 items • Updated Jul 13, 2024 • 1

liked a dataset almost 2 years ago

nvidia/HelpSteer

Viewer • Updated Dec 18, 2024 • 37.1k • 2.43k • 248

liked a model almost 2 years ago

nvidia/Llama2-70B-SteerLM-Chat

Text Generation • Updated Jan 4, 2024 • 2 • 23

liked a dataset about 2 years ago

allenai/WildChat-1M

Viewer • Updated Oct 17, 2024 • 838k • 14.1k • 435

authored a paper about 2 years ago

Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision

Paper • 2403.09472 • Published Mar 14, 2024 • 1