refactor: move training code to scripts/, add train/eval split, tune GRPO hyperparams fad16c9 Mohammed-Altaf commited on 19 days ago
feat: add OpenEnv TRL wrapper, expand dataset, and add W&B eval tracking 6fa4fbd Mohammed-Altaf commited on 19 days ago
feat: add structured pruning action and random baseline policy d064b19 Mohammed-Altaf commited on 19 days ago
refactor: harden imports, add training extras, and rewrite README 5dd60b9 Mohammed-Altaf commited on 19 days ago
fix: resolve merge conflict markers in openenv.yaml and uv.lock c524b25 Mohammed-Altaf commited on 20 days ago
implement NeuralTuner RL environment for Snapdragon quantization 782222a Mohammed-Altaf commited on 20 days ago