Commit History

Refine GRPO evaluation details and clarify model performance comparisons in Blog.md
83f7214
Running

thepikachu commited on

Revise reasoning for choosing SFT + agentic loop over SFT + GRPO in deployment documentation
94c19b9

thepikachu commited on

Remove placeholder for YouTube demo link in README.md
1ddc10a

thepikachu commited on

Update README.md to correct title and enhance color scheme, app configuration, and metadata
b8672db

thepikachu commited on

Add initial blog post detailing ArchitectureEnv and its design approach
29c929e

thepikachu commited on

Add front matter to README.md for enhanced metadata and project visibility
1cbc836

thepikachu commited on

Update README.md for improved clarity and structure, including quick start instructions and enhanced project description.
26e1800

thepikachu commited on

Add analysis of SFT vs GRPO performance and rationale for model selection
c7bd5c9

thepikachu commited on

Remove outdated inference reward curve plot and add new loss and reward curve plots for improved analysis.
ebba8ad

thepikachu commited on

Add inference reward curve plot
fa5f65c

thepikachu commited on

Update LocalModelClient initialization to use MODEL_REPO_ID instead of MODEL_DIR
1c83fd8

thepikachu commited on

Update README.md with installation instructions and modify agentic_inference.py to print MODEL_REPO_ID instead of MODEL_DIR
0ede54d

thepikachu commited on

updated inference to use the model deployed on sft
842a4f8

thepikachu commited on

moved app.py to fix the runtime error
157160f

thepikachu commited on

Add scripts for supervised fine-tuning and GRPO training
adcad94

thepikachu commited on

round2: inference and planner critic design
786f4c0

thepikachu commited on

Update inference.py
7d3618b
verified

thepikachu commited on

Update inference.py
68aae9a
verified

thepikachu commited on

Update inference.py
09b98c3
verified

thepikachu commited on

Update inference.py
3798f67
verified

thepikachu commited on

Update inference.py
fab8166
verified

thepikachu commited on

Update inference.py
4f94384
verified

thepikachu commited on

Update inference.py
4d83ca3
verified

thepikachu commited on

Update inference.py
b0775a5
verified

thepikachu commited on

Refactor code structure for improved readability and maintainability
19e79af

thepikachu commited on

Add metadata section to README.md for project details
2eacea9

thepikachu commited on

Revise README.md for clarity and structure; update benchmark description and task details
7731b28

thepikachu commited on

Simplify main function by removing host and port arguments in app.py
d932af4

thepikachu commited on

Update error handling for missing HF_TOKEN in inference.py
5d3f541

thepikachu commited on

Refactor requirements file structure
401a2d6

thepikachu commited on

Update inference.py
b4230bf
verified

thepikachu commited on

Fixed architecture step function
abc93f5
verified

thepikachu commited on

Update models.py
c3112a9
verified

thepikachu commited on

Update README.md
77fcd5f
verified

thepikachu commited on

Update server/app.py
27202c4
verified

thepikachu commited on

Update Dockerfile
b746fc2
verified

thepikachu commited on

Updated docker server config
2e4fc24
verified

thepikachu commited on

Update server/app.py
bd60fcd
verified

thepikachu commited on

Update openenv.yaml
6a2868c
verified

thepikachu commited on

Fixed deployment bugs
2c87f99
verified

thepikachu commited on

Upload Dockerfile
fc66ab6
verified

thepikachu commited on

Delete server/Dockerfile
5a8383f
verified

thepikachu commited on

Upload 4 files
3544a3b
verified

thepikachu commited on

Create server/app.py
9c2e75a
verified

thepikachu commited on

Delete server
82ff700
verified

thepikachu commited on

Create server/
cb5582a
verified

thepikachu commited on

Upload 11 files
41da5bf
verified

thepikachu commited on

initial commit
859f7f9
verified

thepikachu commited on