NeuLab @ LTI/CMU

university

https://www.cs.cmu.edu/~neulab/

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

seungone authored a paper about 22 hours ago

Reasoning over mathematical objects: on-policy reward modeling and test time aggregation

seungone authored a paper about 22 hours ago

Soohak: A Mathematician-Curated Benchmark for Evaluating Research-level Math Capabilities of LLMs

Nyandwi updated a dataset 6 days ago

neulab/behavioral-lift

View all activity

authored 2 papers about 22 hours ago

Reasoning over mathematical objects: on-policy reward modeling and test time aggregation

Paper • 2603.18886 • Published Mar 19 • 6

Soohak: A Mathematician-Curated Benchmark for Evaluating Research-level Math Capabilities of LLMs

Paper • 2605.09063 • Published 5 days ago • 71

updated a dataset 6 days ago

neulab/behavioral-lift

Viewer • Updated 6 days ago • 15.3k • 124 • 1

published a dataset 6 days ago

neulab/behavioral-lift

Viewer • Updated 6 days ago • 15.3k • 124 • 1

authored a paper 28 days ago

IDIOLEX: Unified and Continuous Representations for Idiolectal and Stylistic Variation

Paper • 2604.04704 • Published Apr 6

updated a model about 1 month ago

neulab/codescout-14b-strict-k-turns-150

15B • Updated Mar 31 • 1

published a model about 1 month ago

neulab/codescout-14b-strict-k-turns-150

15B • Updated Mar 31 • 1

updated a model about 1 month ago

neulab/codescout-14b-strict-k-turns

15B • Updated Mar 31 • 1

published a model about 1 month ago

neulab/codescout-14b-strict-k-turns

15B • Updated Mar 31 • 1

in neulab/VisualPuzzles about 2 months ago

fix inconsistent data split name

#2 opened about 2 months ago by

updated a dataset about 2 months ago

neulab/VisualPuzzles

Viewer • Updated Mar 27 • 1.17k • 2.39k • 11

authored a paper about 2 months ago

Gained in Translation: Privileged Pairwise Judges Enhance Multilingual Reasoning

Paper • 2601.18722 • Published Jan 26

updated a dataset 2 months ago

neulab/agent-data-collection

Preview • Updated Mar 9 • 5.86k • 112

updated a model 2 months ago

neulab/adversarial-paraphraser-qwen3-8b

8B • Updated Mar 6 • 26 • 3

published a model 2 months ago

neulab/adversarial-paraphraser-qwen3-8b

8B • Updated Mar 6 • 26 • 3

updated a dataset 3 months ago

neulab/proxy-traj

Preview • Updated Feb 24 • 51

published a dataset 3 months ago

neulab/proxy-traj

Preview • Updated Feb 24 • 51

updated a model 3 months ago

neulab/SP3F-7B

Text Generation • 8B • Updated Feb 2 • 4 • 2

updated a model 4 months ago

neulab/cso-distilled-1.7b

Updated Jan 29 • 1

published a model 4 months ago

neulab/cso-distilled-1.7b

Updated Jan 29 • 1