feat(v2): implement multi-step environment with PaperState and per-step rewards 1e548bf ianalin123 commited on Mar 8
feat(v2): filter None fields from OrigamiAction in client step payload 781fd4f ianalin123 commited on Mar 8
feat(v2): update train_grpo.py for step-level prompts and per_step_reward 164cb07 ianalin123 commited on Mar 8
feat(v2): update OrigamiAction/Observation/State for multi-step mode 2bb2c9c ianalin123 commited on Mar 8
feat(v2): add step_reward.py — per-step Kawasaki/Maekawa/coverage reward 8484129 ianalin123 commited on Mar 8
feat(v2): add max_folds to tasks + waterbomb_base + map_fold tasks d82e085 ianalin123 commited on Mar 8
feat(v2): add extract_crease_json and valid_crease reward to training/reward.py 5637418 ianalin123 commited on Mar 8
feat(v2): port PaperState to origami_server/engine/paper_state.py 2a7d45d ianalin123 commited on Mar 8
chore(v2): add shapely dependency for PaperState intersection detection 97039f7 ianalin123 commited on Mar 8
feat: Railway deployment + multi-task GRPO + Modal B200 training f7dd892 ianalin123 commited on Mar 8
Add GRPO training notebook + Dockerfile for cloud training (#1) 769b2e8 praveen287 sissississi commited on Mar 8
Rename server/ to origami_server/ to avoid module name conflict with uvicorn.server 3831c6f praveen287 commited on Mar 8