quinn

jwhe

·

AI & ML interests

None yet

Organizations

New activity in harborframework/parity-experiments 2 months ago

[Parity] CL-bench: codex/gpt-5.2 vs infer_codex.py (50 tasks, 3 trials, MATCHING)

#230 opened 2 months ago by

New activity in harborframework/parity-experiments 3 months ago

[Parity] CL-bench: codex/gpt-5.1 vs original pipeline (50 tasks, 3 trials)

#210 opened 3 months ago by