feat: add benchmark.py with full performance baseline e8e3c0f remdms Claude Opus 4.6 commited on Mar 30