objective,best,learning_rate,num_train_epochs,per_device_train_batch_size,warmup_steps,weight_decay,time_this_iter_s 0.597362118441995,False,3.271499237806267e-05,4,16,1000,0.2565075619288112,144.12740564346313 0.5198963371642762,False,1.0741466184541341e-05,5,16,0,0.07599321456170617,144.151517868042 0.5946714730314795,False,3.37620447413037e-05,3,8,500,0.18998600872372765,155.90862655639648 0.4574393106598997,False,1.4671776366845966e-05,1,16,500,0.21137985700516712,145.89185881614685 0.4709334976994894,False,1.046796947096866e-05,2,16,0,0.026531140748479454,145.94519090652466 0.47061894490157463,False,4.9540747889715425e-05,1,16,1000,0.06647603436044074,145.85856461524963 0.4375637251729597,False,2.580653331424728e-05,1,16,1000,0.07837433494112092,145.91601490974426 0.46680378064714356,False,4.168611764617624e-05,1,16,1000,0.2422358346736178,145.84486627578735 0.5901139866275458,False,2.7145037133288976e-05,3,8,1000,0.17082315068595946,155.86436223983765 0.6201819619499414,True,4.985384913085322e-05,3,8,500,0.20061124234729208,155.91449403762817