integrate production simulator, stability math, and reward recalibration 654c8c7 PranavKK1201 commited on Mar 27