feat: add OpenEnv TRL wrapper, expand dataset, and add W&B eval tracking 6fa4fbd Mohammed-Altaf commited on Apr 25
feat: add structured pruning action and random baseline policy d064b19 Mohammed-Altaf commited on Apr 25
refactor: harden imports, add training extras, and rewrite README 5dd60b9 Mohammed-Altaf commited on Apr 25
implement NeuralTuner RL environment for Snapdragon quantization 782222a Mohammed-Altaf commited on Apr 25