refactor: update environment task loading logic and add complexity reasoning datasets c5cf3c7 Running TM23-sanji commited on Apr 8
refactor: update judge model to Qwen2.5-72B, standardize environment variables, and enhance task configuration schema 2d3fda8 TM23-sanji commited on Apr 8
feat: initialize complexity reasoning datasets and update server app logic 1a23a65 TM23-sanji commited on Apr 8
feat: reduce inference sample size and add complexity reasoning results dataset 3bc08e3 TM23-sanji commited on Apr 8
refactor: update inference defaults, standardize openenv.yaml schema, and overhaul submission validation script c680956 TM23-sanji commited on Apr 8
refactor: add main entry point wrapper to server and initialize complexity reasoning datasets fd1410f TM23-sanji commited on Apr 8
refactor: update app.py re-exports and add initial complexity reasoning datasets b09b0de TM23-sanji commited on Apr 8
feat: add server entry point, initialize complexity reasoning datasets, and update task configuration with null graders 59cdc6a TM23-sanji commited on Apr 8
feat: add brace-counting fallback for Rust solution extraction and initialize result datasets d9bc7db TM23-sanji commited on Apr 8
feat: implement difficulty-based reward weighting and update environment configuration schema 347ee2e TM23-sanji commited on Apr 8
refactor: migrate inference script to use HF Space HTTP endpoints and update logging format 5318de1 TM23-sanji commited on Apr 7
chore: update data directory path and synchronize project dependencies d17eaec TM23-sanji commited on Apr 7
feat: allow passing a custom temporary directory to compile_rust_code to persist binaries for testing 23167e9 TM23-sanji commited on Apr 7
build: install gcc and libc6-dev in Dockerfile to support Rust linking 7e1e644 TM23-sanji commited on Apr 7
feat: add _strip_impl_wrapper to extract function bodies from model-generated impl blocks during code assembly 048c0a4 TM23-sanji commited on Apr 7
fix: prepend missing struct Solution definition to assembled code when placeholder is used 2f8318e TM23-sanji commited on Apr 7
refactor: update inference script to support multi-problem evaluation and synchronous execution 01ca815 TM23-sanji commited on Apr 7
feat: initialize FastAPI app and simplify landing page HTML documentation 3f25477 TM23-sanji commited on Apr 7
feat: add interactive landing page with environment documentation to the server root endpoint 48fa334 TM23-sanji commited on Apr 7
refactor: simplify app initialization by passing AlgoReasoningEnvironment class directly to create_openenv_app 4dab815 TM23-sanji commited on Apr 7
feat: add README and update Dockerfiles to install algo_reasoning_env via pyproject.toml ee0ec10 TM23-sanji commited on Apr 7
feat: add app.py entry point, configure Docker for local package installation, and update README for HuggingFace Spaces deployment dd80bb4 TM23-sanji commited on Apr 7